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FIELD OF THE INVENTION 

The present invention relates to single-chain multimeric polypeptides and polypep- 
5 tide conjugates, in particular to multimeric cytokine polypeptides comprising at least two 
monomelic units of a cytokine polypeptide of a type that normally is biologically active in 
monomelic form. 

BACKGROUND OF THE INVENTION 

10 It is well known that biopharmaceuticals such as granulocyte colony stimulating 

factor, erythropoietin, interferons, interleukins, growth hormones and factors and coagula- 
tion factors mediate their effect on cells by binding to specific receptors on cell surfaces, 
whereby cytokine-receptor complexes are formed that activate certain signal transduction 
pathways in the cells. The cytokines bind to their receptors as monomers or as multimers, 

15 e.g. dimers, trimers or tetramers. 

Many cytokine pharmaceuticals and other injected protein pharmaceuticals suffer 
from the problem of an insufficient plasma half-life in the body, so that injections must be 
made relatively frequently in order to maintain a sufficient level of the protein in vivo. This 
is disadvantageous since it is both inconvenient for the patient and costly, and it is therefore 

20 often desirable to be able to increase the plasma half -life of protein pharmaceuticals. Vari- 
ous approaches have been used with the aim of increasing the plasma half -life of such pro- 
teins. Since it is known that increasing the size of the protein generally tends to increase the 
plasma half-life, several of these approaches have involved increasing the molecular weight 
by various means. 

25 One such approach has been to increase the molecular weight by formation of 

polypeptide conjugates, for example by conjugation with a polymer such as polyethylene 
glycol (PEG). This process, called "PEGylation", has been applied to a wide variety of pro- 
teins and has the advantage of often resulting in a reduced antigenicity. PEGylation has in 
some cases, however, led to a reduction in the biological activity of the PEGylated molecule. 

30 Another approach for increasing the molecular weight of a protein is to link two or 

more proteins together either via a chemical linkage or via a peptide bond or peptide linker. 
This has most often been applied to conjugates of two different proteins, e.g. a protein of 
interest fused to human albumin or another abundant plasma protein. Further, there are a 
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number of disclosures of various single-chain antibody constructs or single-chain multimeric 
antigen constructs used for vaccine purposes. 

Certain single-chain polypeptide variants of cytokines that are biologically active in 
vivo only in dimeric form are disclosed in the literature. 
5 Li et al. (J. Biol Chem. 271(4): 1817-20, Jan. 1996; and Biol Chem. 271(49): 

31729-31734, Dec. 1996) disclose single-chain interleukin 5 (scIL-5) and mutants thereof 
produced in order to study receptor binding. Human interleukin 5 is biologically active as a 
disulfide-linked homodimer, and the scIL-5 mutants of Li et al. were based on sclL-5 con- 
structed by linking two IL-5 monomers with a Gly-Gly linker. 

10 Leong et al. {Protein Sci. 6(3): 609-17, Mar. 1997) disclose single-chain dimers of 

interleukin 8 linked via a peptide linker and with a single disulfide crosslink to prevent for- 
mation of multimers. The single-chain BL-8 dimers were designed to mimic the dimeric form 
of IL-8 in solution with the aim of producing heterodimer IL-8 variants. 

Randal et al. (Protein Sci. 4:1057-60, Apr. 1998) disclose a single-chain variant of 

15 human interferon-gamma (IFN-y) in which two IFN-y chains are linked with an eight residue 
polypeptide linker. Landar et al. (J. Mol Biol. 299(1): 169-79, May 2000) disclose a bio- 
logically active single-chain mutant of IFN-y produced by linking the two peptide chains of 
IFN-y with a seven-residue linker and by mutating Hisl 1 1 in the first chain. 

WO 98/27230 discloses various methods for polypeptide engineering using 

20 recursive sequence recombination (RSR), including single-chain versions of multisubunit 
factor proteins. 

Multimeric polypeptides with polypeptide units joined by a cleavable linkage have 
also been disclosed for the purpose of producing monomelic polypeptides. For example, 
WO 00/17336 discloses a DNA cassette comprising two or more tandem repeating units of a 

25 nucleotide sequence encoding a biologically active short peptide attached to a linker peptide 
cleavable by a protease or a chemical agent. Expression of the DNA cassette results in a 
multimeric polypeptide that is subsequently cleaved enzymatically or chemically to result in 
the desired biologically active peptide. 

US 5,218,093 discloses biologically active variants of epidermal growth factor 

30 (EGF) comprising at least two monomeric EGF units that may be linked either by direct C- 
terminus to N-terminus fusion or through a cleavage-insensitive peptide linker. Human EGF 
is a monomeric protein having 53 amino acid residues, and the problem addressed by US 
5,218,093 is that of preventing undesired intracellular degradation by native proteases found 
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in the recombinant microbial hosts used to produce recombinant EGF. According to this 
document, this avoids the need to produce recombinant EGF as a fusion protein bound to a 
stabilizing carrier, which requires subsequent downstream processing to release the EGF in 
monomeric form, or to produce EGF in multimeric form with repeating units linked through 
5 a cleavage-sensitive site that is subsequently digested using a site-specific enzyme. It is dis- 
closed that the multimeric EGF has biological activity characteristic of monomeric EGF, but 
no other advantages aside from those specifically related to the production process are de- 
scribed. 

US 5,684,136 discloses a method for receptor activation by providing a conjugate 
10 comprising the direct fusion of a first ligand and a second ligand capable of binding to first 
and second receptors, and contacting the conjugate with the first and second receptors. The 
conjugate is in particular a chimeric molecule comprising a dimer of hepatocyte growth fac- 
tor (HGF), a disulfide-linked heterodimer comprising a 69 kDa alpha subunit and a 34 kDa 
beta subunit. 

15 WO 99/38891 discloses modified polypeptides with increased biological activity, in 

particular modified erythropoietin (EPO), in the form of multimeric polypeptides with poly- 
peptide units covalently linked by thioester bonds. 

WO 99/02710 discloses recombinant fusion protein multimers with altered biologi- 
cal activity such as increased plasma half-life, comprising two or more protein molecules 

20 fused directly or via a peptide linker, in particular modified EPO. This document teaches 
against conjugating proteins to chemical compounds or inert molecules as a way to increase 
biological activity, as this is said to often result in a significant decrease in the overall bio- 
logical activity. 

WO 92/06116 discloses a recombinant hematopoietic molecule comprising at least 
25 a portion of a first hematopoietic molecule having early myeloid differentiation activity and 
at least a portion of a second hematopoietic molecule having late myeloid differentiation 
activity, the recombinant hematopoietic molecule having early myeloid differentiation activ- 
ity associated with the first hematopoietic molecule and late myeloid differentiation activity 
associated with the second hematopoietic molecule. The first hematopoietic molecule may 
30 be IL-3 or GM-CSF, and the second hematopoietic molecule may be EPO, G-CSF, IL-5 or 
M-CSF. 

WO 01/03737 (published 18 Jan. 2001) discloses fusion proteins comprising a cy- 
tokine or growth factor fused to an immunoglobulin domain, in particular IgG. Also dis- 
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closed are multimeric fusion proteins comprising two or more members of the growth hor- 
mone superfamily joined with or without a peptide linker, although no such multimeric fu- 
sion proteins are actually exemplified. 

Paige et al. (Pharm. Res. 12(12):1883-8, Dec. 1995) disclose prolonged circulation 
5 of recombinant human granulocyte-colony stimulating factor (rhG-CSF) by covalently link- 
ing rhG-CSF to serum albumin through a heterobifunctional maleimido-carboxyl polyethyl- 
ene glycol. 

The present invention provides single-chain multimeric polypeptides with advanta- 
geous biological properties based on monomelic units of a polypeptide of a type that nor- 
10 mally is biologically active in monomelic form, in particular based on polypeptide units that 
in their native form are non-glycosylated or have a relatively low degree of glycosylation. 

BRIEF DISCLOSURE OF THE INVENTION 

In its broadest aspect, the present invention relates to single-chain multimeric poly- 

15 peptides and polypeptide conjugates, in particular polypeptides and conjugates exhibiting 
agonist activity, comprising at least two monomelic units of a polypeptide of a type that is 
biologically active in monomelic form, as well as methods for preparation of the polypep- 
tides and their use in medical treatment. 

Accordingly, one aspect of the invention relates to single-chain multimeric poly- 

20 peptide conjugate comprising at least two units of a monomelic polypeptide linked via a 
peptide bond or a peptide linker, wherein the monomelic polypeptide is of a type that is bio- 
logically active in monomeric form, and having at least one polymer moiety covalently 
bound to an attachment group of said polypeptide. 

Another aspect of the invention relates to a single-chain multimeric polypeptide 

25 conjugate comprising at least two units of a monomeric polypeptide linked via a peptide 
bond or a peptide linker, wherein the monomeric polypeptide is of a type that is biologically 
active in monomeric form, and wherein at least one of said units differs from the corre- 
sponding wild-type polypeptide in that at least one amino acid residue comprising an at- 
tachment group for a non-polypeptide moiety has been introduced or removed, and having at 

30 least one non-polypeptide moiety covalently bound to an attachment group of said polypep- 
tide. 

A further aspect of the invention relate to nucleotide sequences encoding the single- 
chain polypeptides, expression vectors and host cells comprising such a nucleotide sequence 
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and a method for preparing the polypeptides and conjugates by recombinant DNA tech- 
niques. 

In still further aspects, the invention relates to a pharmaceutical composition com- 
prising a polypeptide or conjugate of the invention together with a pharmaceutically accept- 
5 able excipient or vehicle, the use in therapy of a single-chain polypeptide or conjugate of the 
invention, and a method of treating a mammal using a single-chain polypeptide or conjugate 
of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 Figure 1 shows the in vivo biological activity of single-chain G-CSF dimers in 

healthy rats. 

Figure 2 shows the in vivo biological activity of single-chain G-CSF dimers in rats 
with chemotherapy-induced neutropenia. 

15 DETAILED DISCLOSURE OF THE INVENTION 

Definitions 

In the context of the present specification and claims, the following definitions ap- 
ply: 

The term "polypeptide" is understood to indicate a mature protein or a precursor 
20 form thereof as well as a functional fragment thereof which essentially has retained the ac- 
tivity of the mature protein, i.e. exhibits at least the same qualitative activity and preferably 
also at least a similar quantitative activity as the mature protein. A functional fragment may 
for instance be an N- and/or C-terminal truncated form of a full-length polypeptide, or an 
isoform, in particular a native isoform, of a full-length polypeptide. 
25 The monomeric subunits of the polypeptides of the invention are derived from or 

otherwise made so as to mimic the structure and function of parent polypeptides which in 
their native form are biologically active as monomers, i.e. the parent polypeptides do not 
normally require formation of dimers, trimers, etc. in order to be biologically active in vivo. 
The term "derived" is intended to indicate that the monomeric polypeptide subunit 
30 is prepared to mimic structural and/or functional properties of the corresponding native 

polypeptide in question. The "derived" polypeptide may have the same amino acid sequence 
as said native polypeptide, or it may have one or more amino acid changes, e.g. those made 
in accordance with the present invention. Thus, typically, the amino acid sequence of the 
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"derived" polypeptide is at least 60% identical (homologous) to that of said native polypep- 
tide, normally at least 70% identical, such as at least 80% or at least 90% or 95% identical. 
Preferably, the monomelic polypeptides used in the present invention are of mammalian 
origin, in particular of human origin. It will be understood that derived polypeptides include 
5 synthetic polypeptides with the necessary structural and/or functional similarity to a native 
polypeptide. 

The terms "homology" and "identity" as used in connection with amino acid se- 
quences are used in their conventional meanings. Amino acid sequence homology/identity is 
conveniently determined from aligned sequences (aligned by use of the CLUSTAL W Mul- 

10 tiple Sequence Alignment Program, version 1.8, June 1999 (Thompson et al. (1994) 
CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment 
through sequence weighting, positions-specific gap penalties and weight matrix choice, Nu- 
cleic Acids Research, 22:4673-4680) using default parameters) or provided from the PFAM 
families database version 4.0 (http://pfam.wustl.edu/) {Nucleic Acids Res. 1999 Jan 1; 

15 27(l):260-2) by use of GENEDOC version 2.5 (Nicholas, K.B., Nicholas H.B. Jr., and Deer- 
field, D.W. II. 1997 GeneDoc: Analysis and Visualization of Genetic Variation, EMB- 
NEW.NEWS 4:14; Nicholas, K.B. and Nicholas H.B. Jr. 1997 GeneDoc: Analysis and 
Visualization of Genetic Variation). 

The term "parent polypeptide" is used about the monomelic polypeptide, which in 

20 accordance with the present invention is provided in a modified single-chain form compris- 
ing two or more monomers. The individual monomers are linked by peptide bonds, option- 
ally through a linker peptide, rather than being linked by non-covalent bonds or disulfide 
bonds. Accordingly, the single-chain polypeptides of the invention are expressed as a single 
polypeptide from a single nucleotide sequence rather than being expressed as single mono- 

25 mer molecules which are assembled to an oligomeric polypeptide only after expression. The 
parent monomer may be a wild-type monomelic polypeptide or a variant thereof, for in- 
stance a mutein form of the wild-type monomelic polypeptide which has been prepared by 
substitution or deletion of one or more amino acid residues thereof and/or insertion of one or 
more additional amino acid residues therein. 

30 In the present context, the term "multimeric" or "oligomeric" polypeptide is merely 

intended to indicate that the polypeptide comprises two or more monomelic units that are 
linked together, (i.e. to form a dimer, trimer, tetramer., etc.) since, strictly speaking, the sin- 
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gle-chain polypeptides of the invention cannot be said to be multimeric. The terms "dimer", 
"dimeric", "trimer", "trimeric", etc. are used in the same manner. 

The term "signalling polypeptide" is used to denote a polypeptide that interacts 
with a cellular receptor so as to activate the receptor and thereby provide a signal initiating a 
5 signal transduction cascade in the cell carrying the receptor. Such a polypeptide is often also 
termed a ligand. In the present context, an "agonist" is a molecule which is capable of bind- 
ing to a desired receptor to result in an activated receptor complex. An "antagonist" on the 
other hand is a molecule which is capable of binding to a desired receptor but incapable of 
mediating correct conformational changes of the receptor molecules necessary to result in an 

10 activated complex, whereby ligand-mediated receptor activation is substantially inhibited. 
Receptor activation upon binding of a suitable ligand generally involves conformational 
change in the receptor, e.g. oligomerisation of receptor subunits. A polypeptide of the inven- 
tion "having agonist activity" refers to the fact that the multimeric polypeptides are able to 
bind to and activate at least one cellular receptor via at least one monomelic subunit, where 

15 said subunit typically is derived from or identical to a native agonist polypeptide having a 
similar agonist activity. Although the invention is primarily directed to polypeptides and 
conjugates that have agonist activity, it is also contemplated that the principles of the inven- 
tion will be equally applicable to single-chain multimeric antagonist polypeptides, i.e. in 
which at least one monomelic unit of the polypeptide is able to bind to a target receptor 

20 without activating the receptor. 

The term "ligand-binding domain" refers to the part or parts of a cellular receptor 
which is/are involved in specific recognition of and interaction with a receptor-binding site 
of an endogenous ligand. Analogously, the "receptor-binding site" is understood as those 
amino acid residues which are involved in polypeptide binding to the ligand-binding domain 

25 of the receptor. Normally, the receptor-binding site comprises 1-50 amino acid residues, 
such as 5-30 or 10-25 amino acid residues. The amino acid residues in question may be lo- 
cated in sequence, but are more often placed in spatial proximity to each other as a result of 
the folding of the polypeptide. In the Materials and Methods section below, a method is de- 
scribed for determining residues of a receptor-binding site. 

30 The term "receptor" is understood to indicate a protein present on a cell surface 

which binds signalling molecules (i.e. ligands) as the first step in triggering the signal trans- 
duction cascade. Cell surface receptors are typically composed of different domains with 
different functions, such as an extracellular ligand-binding domain with which the signalling 



WO 02/36626 



PCT/DK01/00724 



8 

polypeptide interacts to initiate signal transduction, a transmembrane domain (or in some 
cases, several transmembrane domains) which anchors the receptor in the cell membrane, 
and an intracellular effector domain which generates a cellular signal in response to ligand 
binding (signal transduction). 
5 The term "conjugate" (or interchangeably "conjugated polypeptide") is intended to 

indicate a composite or chimeric molecule formed by the covalent attachment of one or 
more polypeptides to one or more non~peptide moieties. The term "covalent attachment" 
means that the polypeptide and the non-peptide moiety are either directly covalently joined 
to one another, or else are indirectly covalently joined to one another through an intervening 

10 moiety or moieties, such as a bridge, spacer, or linkage moiety or moieties. Preferably, the 
conjugated polypeptide is soluble at relevant concentrations and conditions, i.e. soluble in 
physiological fluids such as blood. Examples of conjugated polypeptides of the invention 
include glycosylated polypeptides and PEGylated polypeptides. The term "non-conjugated 
polypeptide" can be used about the polypeptide part of the conjugate. 

15 The term "non-peptide moiety" is intended to indicate a molecule which is not a 

peptide and which is capable of conjugating to an attachment group of the polypeptide of the 
invention. Preferred examples of such molecule include polymers, e.g. polyalkylene oxide, 
oligosaccharide moieties, lipophilic groups, e.g. fatty acids, and ceramides. The "polymer 
molecule" is a molecule formed by covalent linkage of two or more monomers, wherein 

20 none of the monomers is an amino acid residue, except where the polymer is human albumin 
or another abundant plasma protein. The term "polymer" may be used interchangeably with 
the term "polymer molecule". Except where the number of polymer molecules is expressly 
indicated, every reference to "a polymer", "a polymer molecule", "the polymer" or "the 
polymer molecule" contained in a polypeptide of the invention or otherwise used in the pre- 

25 sent invention shall be a reference to one or more polymer molecules. 

The term "oligosaccharide moiety" is intended to indicate a carbohydrate- 
containing molecule comprising two or more monosaccharide residues, and which is capable 
of being attached to the polypeptide to produce a polypeptide conjugate in the form of a gly- 
cosylated polypeptide by way of in vivo or in vitro glycosylation. The term "in vivo glycosy- 

30 lation" is intended to mean any attachment of an oligosaccharide moiety occurring in vivo, 
i.e. during posttranslational processing in a glycosylating cell used for expression of the 
polypeptide, e.g. by way of N-linked and O-linked glycosylation. Usually, the N- 
glycosylated oligosaccharide moiety has a common basic core structure composed of five 
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monosaccharide residues, namely two N-acetylglucosamine residues and three mannose 
residues. The exact oligosaccharide structure depends, to a large extent, on the glycosylating 
organism in question and on the specific polypeptide. Depending on the host cell in question 
the glycosylation may be classified as a high mannose type, a complex type or a hybrid type. 
5 The term "zrc vitro glycosylation" is intended to refer to a synthetic glycosylation performed 
in vitro, normally involving covalently linking an oligosaccharide moiety to an attachment 
group of a polypeptide, optionally using a cross-linking agent. In vivo and in vitro glycosyla- 
tion are discussed in detail further below. 

Although an oligosaccharide may normally be considered to be a polymer, in the 

10 context of the present specification and claims, "oligosaccharide moieties" are considered 
separately from "polymer molecules" as defined above, i.e. the term "polymer moiety" as 
used herein is not intended to encompass oligosaccharide moieties. It will be clear to persons 
skilled in the art in light of the present specification, however, that a polypeptide conjugate 
of the invention comprising one or more oligosaccharide moieties bound to an attachment 

15 group of a polypeptide, e.g. by way of in vivo glycosylation, may in addition comprise one 
or more polymer moieties bound to an attachment group of the polypeptide, e.g. by way of 
PEGylation. 

In one embodiment, the multimeric polypeptide conjugate of the invention is one in 
which at least one monomelic polypeptide unit is non-glycosylated or is glycosylated at one, 

20 two or three amino acid residues, the glycosyl moieties typically having a molecular weight 
of about 1-10 kDa, more typically 1-6 kDa.. In many cases, this will apply to all of the 
monomelic units. Normally, the monomelic polypeptide units of the invention will be de- 
rived from monomelic polypeptides that in their native form are non-glycosylated or have a 
relatively low degree of glycosylation (a low degree of glycosylation being understood in 

25 the present context as glycosylation at not more than three sites on a monomelic polypep- 
tide). 

An "N-glycosylation site" has the sequence N-X'-S/T/C-X", wherein X' is any 
amino acid residue except proline, X' ' is any amino acid residue that may or may not be 
identical to X' and preferably is different from proline, N is asparagine and S/T/C is either 
30 serine, threonine or cysteine, preferably serine or threonine, and most preferably threonine. 
An "O-glycosylation site" is the OH-group of a serine or threonine residue. 

The term "attachment group" is intended to indicate a functional group of the poly- 
peptide, in particular of an amino acid residue thereof or an oligosaccharide moiety, capable 



WO 02/36626 PCT/DKO 1/00724 

10 

of attaching a non-peptide moiety such as a polymer molecule, a lipophilic molecule or an 
organic derivatizing agent. Useful attachment groups and their matching non-peptide moie- 
ties are apparent from the table below. 



Attachment 
group 


Amino acid 


Examples of non- 
peptide moiety 


Conjugation 
method/- 
Activated PEG 


Reference 


-NH 2 


N-terminal, 
Lys 


Polymer, e.g. PEG, 
with amide or imine 
group 


mPEG-SPA 
Tresylated mPEG 


Shearwater Corp. 
Delgado et al., 
critical reviews in 
Therapeutic Drug 
Carrier Systems 
9(3,4):249-304 
(1992) 


-COOH 


C-terminal, 
Asp, Glu 


Polymer, e.g. PEG, 
with ester or amide 
group 

Oligosaccharide 
moiety 


mPEG-Hz 

In vitro coupling 


Shearwater Corp. 


-SH 


Cys 


Polymer, e.g. PEG, 
with disulfide, 
maleimide or vinyl 
sulfone group 
Oligosaccharide 
moiety 


PEG- 

vinylsulphone 
PEG-maleimide 

In vitro coupling 


Shearwater Corp. 
Delgado et al., 
critical reviews in 
Therapeutic Drug 
Carrier Systems 
9(3,4):249-304 
(1992) 


-OH 


Ser, Thr, 
OH-, Lys 


Oligosaccharide 
moiety 

PEG with ester, 
ether, carbamate, 
carbonate 


In vivo O-linked 
glycosylation 




-CONH 2 


Asn as part 
of an N- 
glycosyla- 
tion site 


Oligosaccharide 
moiety 

Polymer, e.g. PEG 


In vivo N- 
glycosylation 




Aromatic 
residue 


Phe, Tyr, 
Trp 


Oligosaccharide 
moiety 


In vitro coupling 




-CONH 2 


Gin 


Oligosaccharide 
moiety 


In vitro coupling 


Yan and Wold, 
Biochemistry, 
1984, Jul 31; 
23(16): 3759-65 


Aldehyde 
Ketone 


Oxidized 
oligo- 
saccharide 


Polymer, e.g. PEG, 
PEG-hydrazide 


PEGylation 


Andresz et al., 
1978, Makromol. 
Chem. 179:301, 
WO 92/16555, 
WO 00/23114 
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Guanidino 


Arg 


Oligosaccharide 
nioieiy 


In vitro coupling 


Lundblad and 
i>oyes, i^neijuicdj. 
Reagents for Pro- 
tein Modification, 
CRC Press Inc., 
Florida, USA 


Imidazole 
ring 


His 


Oligosaccharide 
moiety 


In vitro coupling 


As for guanidine 



For in vivo N-glycosylation, the term "attachment group" is used to indicate the 
amino acid residues that together constitute an N-glycosylation site. Although the asparagine 
residue of the N-glycosylation site is where the oligosaccharide moiety is attached during 
5 glycosylation, such attachment cannot be achieved unless the other amino acid residues of 
the N-glycosylation site are present. Accordingly, when the non-peptide moiety is an oligo- 
saccharide moiety and the conjugation is to be achieved by N-glycosylation, the term 
"amino acid residue comprising an attachment group for the non-peptide moiety" as used in 
connection with alterations of the amino acid sequence of the polypeptide of interest is to be 

10 understood as meaning that one or more amino acid residues constituting an N-glycosylation 
site are to be altered in such a manner that either a functional N-glycosylation site is intro- 
duced into the amino acid sequence or removed from said sequence. 

The term "comprising an attachment group" is intended to mean that the attachment 
group is present on an amino acid residue (including an N-glycosylation site) of the relevant 

15 peptide or polypeptide or on an oligosaccharide moiety attached to said peptide or polypep- 
tide. 

Amino acid names and atom names (e.g. CA, CB, NZ, N, O, C, etc.) are used 
herein as defined by the Protein DataBank (PDB) (www.pdb.org), which is based on the 
IUPAC nomenclature (IUPAC Nomenclature and Symbolism for Amino Acids and Peptides 

20 (residue names, atom names, etc.), Eur. J. Biochem., 138, 9-37 (1984) together with their 
corrections in Eur. J. Biochem., 152, 1 (1985). The term "amino acid residue" is intended to 
indicate an amino acid residue contained in the group consisting of the 20 naturally occur- 
ring amino acids, i.e. alanine (Ala or A), cysteine (Cys or C), aspartic acid (Asp or D), glu- 
tamic acid (Glu or E), phenylalanine (Phe or F), glycine (Gly or G), histidine (His or H), 

25 isoleucine (lie or I), lysine (Lys or K), leucine (Leu or L), methionine (Met or M), aspar- 
agine (Asn or N), proline (Pro or P), glutamine (Gin or Q), arginine (Arg or R), serine (Ser 
or S), threonine (Thr or T), valine (Val or V), tryptophan (Trp or W) and tyrosine (Tyr or 
Y). The terminology used for identifying amino acid positions/substitutions is illustrated as 
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follows: T13 in a given amino acid sequence indicates position number 13 occupied by a 
threonine residue. T13K indicates that the threonine residue of position 13 has been substi- 
tuted by a lysine residue. Multiple substitutions are indicated with a "+", e.g. S93N+G95S/T 
means an amino acid sequence which comprises substitution of the serine residue in position 
5 93 by an asparagine residue and substitution of the glycine residue in position 95 by a serine 
or a threonine residue. 

The term "nucleotide sequence" is intended to indicate a consecutive stretch of two 
or more nucleotide molecules. The nucleotide sequence may be of genomic, cDNA, RNA, 
semi-synthetic or synthetic origin, or any combination thereof. 

10 "Cell", "host cell", "cell line" and "cell culture" are used interchangeably herein 

and all such terms should be understood to include progeny resulting from growth or cultur- 
ing of a cell. "Transformation" and "transfection" are used interchangeably to refer to the 
process of introducing DNA into a cell. 

"Operably linked" refers to the covalent joining of two or more nucleotide se- 

15 quences in such a manner that the normal function of the sequences can be performed. For 
example, the nucleotide sequence encoding a presequence or secretory leader is operably 
linked to a nucleotide sequence for a polypeptide if it is expressed as a preprotein that par- 
ticipates in the secretion of the polypeptide: a promoter or enhancer is operably linked to a 
coding sequence if it affects the transcription of the sequence. 

20 The term "introduce" is primarily intended to mean substitution of an existing 

amino acid residue, but may also mean insertion of an additional amino acid residue. The 
term "remove" is primarily intended to mean substitution of the amino acid residue to be 
removed by another amino acid residue, but may also mean deletion (without substitution) 
of the amino acid residue to be removed. 

25 The term "reduced immunogenicity" is intended to indicate that the conjugate gives 

rise to a measurably lower immune response than a reference molecule, e.g. a wild-type 
polypeptide, as determined under comparable conditions. The immune response may be a 
cell or antibody mediated response (see, e.g., Roitt: Essential Immunology (8th Edition, 
Blackwell) for further definition of immunogenicity). Normally, reduced antibody reactivity 

30 will be an indication of reduced immunogenicity. The reduced immunogenicity may be de- 
termined by use of any suitable method known in the art, e.g. in vivo or in vitro. 

The term "functional in vivo half -life" is used in its normal meaning, i.e. the time at 
which 50% of the biological activity of the polypeptide or conjugate is still present in the 
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body/target organ, or the time at which the activity of the polypeptide or conjugate is 50% of 
the initial value. As an alternative to determining functional in vivo half -life, "serum half- 
life" may be determined, i.e. the time in which 50% of the polypeptide or conjugate mole- 
cules circulate in the plasma or bloodstream prior to being cleared. Determination of serum 
5 half-life is often more simple than determining the functional in vivo half-life, and the mag- 
nitude of serum half-life is usually a good indication of the magnitude of functional in vivo 
half-life. Alternative terms to serum half-life include "plasma half-life", "circulating half- 
life", "serum clearance", "plasma clearance" and "clearance half-life". The polypeptide or 
conjugate is cleared by the action of one or more of the reticuloendothelial systems (RES), 

10 kidney, spleen or liver, by receptor-mediated degradation, or by specific or non-specific pro- 
teolysis. For injected protein pharmaceuticals, clearance is believed to be primarily deter- 
mined by the level of renal clearance and receptor-mediated clearance, while e.g. non- 
specific proteolysis is believed to be of secondary importance. Normally, clearance depends 
on size (relative to the cutoff for glomerular filtration), charge, attached carbohydrate chains, 

15 the presence of cellular receptors for the protein and the affinity of the protein towards its 
receptor(s). The functional in vivo half -life and the serum half -life may be determined by 
any suitable method known in the art, e.g. by the method disclosed in US 5,824,778. 

The term "increased" as used about the functional in vivo half-life or serum half-life 
is used to indicate that the relevant half -life of the conjugate or polypeptide is statistically 

20 significantly increased relative to that of a reference molecule, e.g. a corresponding wild- 
type or non-conjugated monomelic polypeptide, as determined under comparable condi- 
tions. For instance, the relevant half-life may increased by at least 25%, such as by at least 
50%, by at least 100%, or by at least 500% or 1000%. 

The term "renal clearance" is used in its normal meaning to indicate any clearance 

25 taking place by the kidneys, e.g. by glomerular filtration, tubular excretion or tubular elimi- 
nation. Renal clearance depends on physical characteristics of the conjugate, including mo- 
lecular weight, size (diameter), symmetry, shape/rigidity and charge. Usually, a molecular 
weight of roughly about 66-67 kDa is considered to be a cut-off-value for renal clearance 
(although this can vary depending on e.g. the diameter and shape of the molecule, i.e. the 

30 "apparent size" as determined e.g. by SDS-PAGE). A reduced renal clearance may be estab- 
lished by any suitable assay, e.g. an established in vivo assay. Typically, the renal clearance 
is determined by administering a labelled (e.g. radiolabeled or fluorescence labelled) poly- 
peptide conjugate to a patient and measuring the label activity in urine collected from the 
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patient. The reduced renal clearance is determined relative to the corresponding non- 
conjugated polypeptide or the corresponding non-conjugated wild-type polypeptide under 
comparable conditions. 

The term "function" is intended to indicate one or more specific functions of the 
5 polypeptide of interest and is generally to be understood qualitatively (i.e. having a similar 
function as the polypeptide of interest) and not necessarily quantitatively (i.e. the magnitude 
of the function is not necessarily similar). In the present context, the specific function of 
interest will in particular be one or more biological activities, e.g. in vitro or in vivo bioactiv- 
ity. 

10 The interchangeably used terms "measurable function" and "functional" are in- 

tended to indicate that the relevant function (preferably reflecting the intended use) of a con- 
jugate of the invention may be detected when measured by standard methods known in the 
art, e.g. as in vitro and/or in vivo bioactivity. Typically, if not otherwise stated herein, a 
measurable function is at least 1%, such as at least 2% or 5%, preferably at least 10%, such 

15 as at least 25% or 50%, of that of the non-conjugated polypeptide as determined under com- 
parable conditions, e.g. in the range of 1-1000%, such as 5-500% or 10-200% of the func- 
tion of the non-conjugated polypeptide. 

The interchangeably used terms "native" and "wild-type" are used about a polypep- 
tide which has an amino acid sequence that is identical to one found in nature. The native 

20 polypeptide is typically isolated from a naturally occurring source, in particular a mammal- 
ian or microbial source, such as a human source, or is produced recombinantly by use of a 
nucleotide sequence encoding the naturally occurring amino acid sequence. The term "na- 
tive" is intended to encompass allelic variants of the polypeptide in question. A "variant" is 
a polypeptide which has an amino acid sequence that differs from that of a native polypep- 

25 tide in one or more amino acid residues. The variant is typically prepared by modification of 
a nucleotide sequence encoding the native polypeptide (e.g. to result in substitution, deletion 
or truncation of one or more amino acid residues of the polypeptide) or by introduction (by 
addition or insertion) of one or more amino acid residues into the polypeptide) so as to mod- 
ify the amino acid sequence constituting said native polypeptide. A "fragment" is a part of a 

30 parent native or variant polypeptide, typically differing from such parent in one or more C- 
terminal or N-terminal amino acid residues or both types of such residues. Normally, the 
variant or fragment has retained at least one of the functions of the corresponding parent 



WO 02/36626 



PCT/DK01/00724 



15 

polypeptide, although as indicated above, the function of a variant or fragment need not be 
quantitatively comparable to that of the parent polypeptide. 

Polypeptides and conjugates of the invention 

In the context of the present description and claims, any reference to a polypeptide 
5 that is "biologically active in monomelic form" is intended to mean a cytokine, hormone, 
growth factor or other polypeptide with therapeutic activity that in its native form and in its 
native environment is mainly found in monomelic form in solution, and where the native 
monomelic polypeptide is biologically active in vivo. This includes polypeptides where only 
one monomer is required for biological activity (typically polypeptides where binding of a 

10 single polypeptide elicits receptor activation) as well as polypeptides that occur as a mono- 
mer in vivo in physiological fluids, but where two, three or possibly more individual mono- 
mers together are responsible for receptor activation. An example of a polypeptide in this 
latter category is G-CSF, since native hG-CSF occurs in vivo as a monomer. Activation of 
the G-CSF receptor takes place via binding of two G-CSF molecules to a pair of G-CSF 

15 receptors, forming a 2:2 complex between the two receptor domains and the ligand, even 
though the two G-CSF molecules do not form a dimer as such (Aritomi et al., Nature, Vol. 
401: 713-717, 14 Oct. 1999). 

The monomelic polypeptides outlined above can be contrasted to polypeptides that 
normally occur in vivo in the form of e.g. dimers or trimers, for example with monomelic 

20 units bound together by disulfide bonds or hydrogen bonding, and where the formation of 
such multimers is a prerequisite for receptor binding or other biological activity. Polypep- 
tides in this category can be either homomers, e.g. a homodimer comprising two identical 
monomers, or heteromers, e.g. a heterodimer comprising two different monomers. 

One of the advantages of the single-chain polypeptides and conjugates of the inven- 

25 tion based on polypeptides that are biologically active in monomeric form is that the single- 
chain multimeric polypeptides will normally have a reduced clearance and a longer circula- 
tion half-life than the corresponding monomeric polypeptides. This is due to the higher mo- 
lecular weight of the multimeric polypeptide as well as possible attachment of non- 
polypeptide moieties. 

30 The biological activity provided by the multimeric polypeptides and conjugates of 

the invention is in particular agonist activity, i.e. binding of a polypeptide of the invention to 
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its receptor results in receptor activation, thereby providing a signal initiating a signal trans- 
duction cascade in the cell carrying the receptor. 

Cytokines that in their native form have a monomelic structure in solution and are 
thus suitable for use in the single-chain polypeptides and conjugates of the invention include 
5 interleukins such as interleukin- 1 alpha (IL-la), interleukin- lbeta (IL-lp), interleukin-lra 
(IL-lra), interleukin-2 (IL-2), interleukin-3 (IL-3), interleukin-4 (IL-4), interleukin-6 (IL-6), 
interleukin-7 (IL-7), interleukin-8 (IL-8), interleukin-9 (IL-9), interleukin-11 (IL-11), inter- 
leukin-13 (IL-13), interleukin-15 (IL-15), interleukin 17 (DL-17) and interleukin 18 (IL-18); 
colony stimulating factors such as granulocyte colony stimulating factor (G-CSF) and granu- 

10 locyte macrophage colony stimulating factor (GM-CSF); growth factors such as epidermal 
growth factor (EGF) and stem cell growth factor; interferons such as interferon alpha 
(INF- a) and interferon beta (INF- (3); members of the tumour necrosis family such as tumour 
necrosis factor alpha (TNF-oc), tumour necrosis factor beta (TNF-(3) and osteoprotegerin 
ligand (OPGL); as well as e.g. erythropoietin (EPO) and human growth hormone. 

15 Although the present invention is primarily directed to single-chain multimeric 

polypeptides based on polypeptides that are biologically active in monomelic form, it is also 
contemplated that the principles described herein will be applicable to polypeptides of the 
type that exist as dimers, trimers and other multimers in vivo. In this case, creating e.g. a 
single-chain dimeric polypeptide based on a polypeptide that normally exists and is active as 

20 a dimer in vivo might be of little significance in terms of the molecular weight of the poly- 
peptide itself (with the possible exception of added weight provided by a relatively large 
linker sequence). However, creating single-chain versions of such polypeptides provides the 
possibility to individually modify the different monomers, e.g. by adding and/or removing 
attachment sites for non-polypeptide moieties. It is therefore contemplated that the princi- 

25 pies of the present invention also can be applied to polypeptides which are biologically ac- 
tive in multimeric form to result in single-chain versions of such polypeptides wherein the 
individual monomers are linked by a peptide bond or a peptide linker, and in particular 
wherein one or more of the monomers have an amino acid sequence that is modified, in rela- 
tion to the native sequence, by the introduction and/or removal of at least one attachment 

30 site for a non-polypeptide moiety. Examples of such polypeptides that are biologically active 
in multimeric form include IL-5 (homodimer), IL-10 (homodimer), EL- 12 (heterodimer), EL- 
16 (homodimer), interferon gamma (dimer), vascular endothelial growth factor (VEGF; 
homodimer), and human fertility hormones such as follicle stimulating hormone (FSH; het- 
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erodimer). As explained below, it has been found that single-chain polypeptides having a 
reduced in vitro bioactivity (reduced receptor binding affinity) also show an increased in 
vivo half-life that can be attributed to a lower rate of receptor-mediated clearance (RMC). In 
a particular embodiment, the invention therefore relates to a general method for reducing 
5 receptor-mediated clearance of a polypeptide, compared to the relevant wild-type polypep- 
tide, by producing the polypeptide in the form of a single-chain construct comprising two or 
more monomelic units linked by a peptide bond or peptide linker. This general method is 
contemplated to be applicable to polypeptides that in their native form are biologically ac- 
tive as monomers as well as to polypeptides that are biologically active as multimers. 

10 While the polypeptides used according to the invention may be of any origin, they 

are normally of mammalian origin, in particular of human origin. 

The multimeric polypeptides and conjugates of the invention comprise two or more 
monomeric polypeptide units with similar biological activity, in particular two or more 
monomelic polypeptide units that have substantially the same biological activity and which 

15 may be substantially similar in sequence. The term "similar biological activity" refers to the 
fact that the two or more monomeric units should be of the same type of molecule and have 
the same type of biological activity or at least be derived from a molecule with the same 
type of biological activity. Although the individual monomeric units may in some cases 
have an identical amino acid sequence, it will be clear that the individual sequences need not 

20 be identical and that the level of their biological activity need not be identical. To illustrate 
using G-CSF (granulocyte colony stimulating factor) as an example, a dimeric polypeptide 
according to the invention may comprise, e.g., two identical monomers of wild-type G-CSF, 
a wild-type G-CSF monomer and a variant in which one or more amino acids have been 
inserted and/or removed relative to the wild-type G-CSF monomer, two G-CSF variants 

25 (which may be identical or different from each other), a wild-type G-CSF monomer and a 
fragment thereof, a G-CSF variant and a fragment of the variant or of the wild-type mono- 
mer, etc. The same principles for combining different monomeric units are of course also 
valid for other multimers such as trimers, tetramers, pentamers, etc. In the case of a trimer, 
for example, it may comprise three identical monomeric units, two identical monomeric 

30 units (wild-type or variant) and one unit which is different from the two (e.g. a variant or 
fragment), or three different units. When the individual monomers are identical the polypep- 
tide is termed a "homomer", and when they are different from each other the polypeptide is 
termed a "heteromer". 



WO 02/36626 



PCT/DK01/00724 



18 

The monomers used for constructing the multimeric polypeptide may be linked by a 
peptide bond, or may be connected by a suitable linker peptide. If used, the linker peptide 
should be of a type (length, amino acid composition, amino acid sequence, etc.) that is ade- 
quate to link two (or more) monomers in such a way that they assume a conformation rela- 
5 tive to one another so that the resulting multimeric polypeptide has the desired activity. Fur- 
thermore, the linker peptide is typically designed to increase the stability of the resulting 
multimeric polypeptide towards proteolytic degradation, e.g. by use of special amino acid 
sequences or residues not readily subject to proteolysis. When the multimeric polypeptide is 
intended for conjugation to a non-polypeptide moiety, the peptide linker sequence may 

10 comprise one or more attachment groups for said non-polypeptide moiety. For instance, 
when the non-polypeptide moiety is an oligosaccharide moiety the linker can contain a se- 
quence that provides an N-glycosylation site. When the non-polypeptide moiety is PEG, the 
linker can contain e.g. Lys or Cys. 

The linker peptide will often predominantly include the amino acid residues Gly, 

15 Ser, Ala and/or Thr. The linker typically comprises 1-30 amino acid residues, such as a se- 
quence of about 2-20 or 3-15 amino acid residues. Likewise, the amino acid residues se- 
lected for inclusion in the linker peptide should exhibit properties that do not interfere sig- 
nificantly with the activity of the multimeric polypeptide. Thus, the linker peptide should on 
the whole not exhibit a charge which would be inconsistent with the desired activity of the 

20 multimeric polypeptide, or interfere with internal folding, or form bonds or other interac- 
tions with amino acid residues in one or more of the monomers which would seriously im- 
pede the binding of the multimeric polypeptide to the ligand-binding domain of the receptor. 

Specific linkers for use in the present invention may be designed on the basis of 
known naturally occurring as well as artificial polypeptide linkers (see, e.g., Hallewell et al. 

25 (1989), J. Biol Chem. 264, 5260-5268; Alfthan et al. (1995), Protein Eng. 8, 725-731; 
Robinson & Sauer (1996), Biochemistry 35, 109-116; Khandekar et al. (1997), J. Biol 
Chem. 272, 32190-32197; Fares et al. (1998), Endocrinology 139, 2459-2464; Smallshaw et 
al. (1999), Protein Eng. 12, 623-630; US 5,856,456). For instance, linkers used for creating 
single-chain antibodies, e.g. a 15mer consisting of three repeats of a Gly-Gly-Gly-Gly-Ser 

30 amino acid sequence ((G^Sefh), are contemplated to be useful in the present invention. 
Other linkers that contemplated to be useful in the present invention are GlySerThrSerGly- 
SerSerGlyLysSerSerGluGlyLysGly, and GlyGlyGlyGlySerGlyGlyGlyAsnSerThrGlyGly- 
GlySer, the latter being an example of a linker providing a glycosylation site (AsnSerThr). 
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Furthermore, phage display technology as well as selective infective phage technology can 
be used to diversify and select appropriate linker sequences (Tang et al., J. Biol. Chem. 271, 
15682-15686, 1996; Hennecke et al. (1998), Protein Eng. 11, 405-410). Also, Arc repressor 
phage display has been used to optimize the linker length and composition for increased 
5 stability of the single-chain protein (Robinson and Sauer (1998), Proc. Natl. Acad. Sci. USA 
95, 5929-5934). 

Another way of obtaining a suitable linker is by optimizing a simple linker, e.g. 
((Gly4Ser) n ), through random mutagenesis. It will be clear from the present specification that 
whatever the nature of the linker, it should be one which is not readily susceptible to cleav- 

10 age by e.g. proteases or chemical agents, since cleavage of the multimeric polypeptide to 
result in two or more monomelic units is not desired in the present context. 

As indicated above, in one embodiment the multimeric polypeptide conjugate com- 
prises two or more monomelic polypeptide units with the same amino acid sequence, and 
with at least one non-polypeptide moiety attached to an attachment group thereof. In this 

15 case, the multimeric polypeptide may comprise two or more wild-type monomelic units 

linked together so as to obtain a multimeric polypeptide having a sufficiently high molecular 
weight, i.e. without any monomelic units that differ from the wild-type polypeptide, other 
than the presence of one or more attached non-polypeptide moieties. Alternatively, the poly- 
peptide may comprise two or more units that are modified, in relation to the relevant wild- 

20 type amino acid sequence, by insertion and/or removal of one or more amino acid residues, 
e.g. by substitution of one or more amino acid residues in the native sequence with a non- 
native amino acid residue. 

In another embodiment, the multimeric polypeptide comprises two or more mono- 
melic polypeptide units with different amino acid sequences. As indicated above, this can 

25 e.g. for the case of a dimer be a wild-type unit and a modified unit or two modified units that 
are different from each other. A single-chain multimeric polypeptide with non-identical 
monomeric units provides the advantage that the individual units can be designed to give 
different desired characteristics to the multimeric polypeptide. For example, the amino acid 
sequence of one of the monomeric units of a dimeric polypeptide of the invention can be 

30 designed for optimal binding to a receptor, while the amino acid sequence of the other 
monomeric unit can be designed to provide optimal attachment sites for non-polypeptide 
moieties in order to obtain desired properties of molecular weight, bulk, etc. and thus desired 
properties in terms of e.g. half -life. 
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It will be clear that at least one of the monomelic units of the multimeric polypep- 
tide must be biologically active, i.e. capable of binding to the intended target receptor with a 
sufficient binding affinity to elicit a desired receptor activation. In some cases it may be suf- 
ficient that only one of the monomelic units is biologically active, so that the other mono- 
5 meric unit(s) has/have a significantly reduced agonist activity or binding affinity or perhaps 
substantially no activity or binding affinity, although it is generally preferred that both (or 
all) of the monomelic units are biologically active and are able to bind to and activate the 
intended receptor. For a multimeric polypeptide of the invention having at least first and 
second monomelic subunits, wherein the first monomelic subunit is biologically active as 

10 defined above, this means that the second monomelic subunit typically has an amino acid 
sequence homology to the first monomelic subunit of at least about 70%, preferably at least 
about 80%, more preferably at least about 85%, still more preferably at least about 90%, 
such as at least about 95%. The same applies to any additional monomelic unit in the case of 
a trimer or higher multimer. The degree of homology may conveniently be determined using 

15 the CLUSTAL W sequence alignment program referred to above. 

Further, it will be clear that the individual monomelic subunits of the single-chain 
multimeric polypeptide will, in the case of a fragment of a full-length polypeptide, have a 
certain minimum length in relation to the relevant wild-type polypeptide. Thus, each mono- 
melic subunit will typically have a length, determined as the number of amino acid residues 

20 compared to the number of amino acid residues in the corresponding wild-type polypeptide 
(e.g. the relevant wild-type human polypeptide) of at least about 50%, preferably at least 
about 60%, more preferably at least about 70%, still more preferably at least about 80%, 
such as at least about 90% or 95% of the length of the wild-type polypeptide. 

By removing and/or introducing one or more amino acid residues comprising an at- 

25 tachment group for a non-polypeptide moiety, it is possible to specifically adapt the poly- 
peptide so as to make the molecule more susceptible to conjugation to the non-polypeptide 
moiety of choice, to optimize the conjugation pattern (e.g, to ensure an optimal distribution 
of non-polypeptide moieties on the surface of the molecule and to ensure that only the at- 
tachment groups intended to be conjugated are present in the molecule) and thereby obtain a 

30 conjugate molecule with improved activity and/or other properties, e.g. reduced immuno- 
genicity. 

In embodiments with one or more modified monomelic units, the amino acid se- 
quence of the wild-type polypeptide and the amino acid sequence of the monomelic unit of 
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the invention may differ in that at least one and preferably more, e.g. 2-15, amino acid resi- 
dues comprising an attachment group for a non-polypeptide moiety have been introduced 
into the monomeric unit, preferably by substitution, compared to the wild-type amino acid 
sequence. Thereby, for instance, shielding by non-polypeptide moieties may be achieved in 
5 more or different regions of the polypeptide molecule, leading to a lower immune response, 
and/or the molecular weight, shape, size and/or charge of the conjugate may be optimised. 
Preferably, such amino acid residue is introduced in an amino acid located at the surface of 
the polypeptide, more preferably in a position occupied by an amino acid residue having 
more than 25%, such as more than 50% or even more than 75% of its side chain exposed at 

10 the surface of the molecule. As used herein, the term "surface-exposed" refers to an amino 
acid residue with a side chain that is at least partially exposed at the surface of the molecule, 
in particular at least 25% exposed. 

A method for determining the percentage exposed surface area of side chains of 
amino acid residues, and thus for identifying suitable positions for modification, based on an 

15 analysis of the 3D structure of the polypeptide, is given in the Materials and Methods sec- 
tion below. Alternatively or additionally, the position to be modified may be identified on 
the basis of an analysis of the sequence family of the polypeptide in question. More specifi- 
cally, the position to be modified can be one that in one or more members of the family 
other than the parent polypeptide is occupied by an amino acid residue comprising the rele- 

20 vant attachment group (when such amino acid residue is to be introduced) or which in the 
parent polypeptide, but not in one or more other members of the family, is occupied by an 
amino acid residue comprising the relevant attachment group (when such amino acid residue 
is to be removed). 

In order to determine an optimal distribution of attachment groups, the distance be- 
25 tween amino acid residues located at the surface of the polypeptide is calculated on the basis 
of the polypeptide's 3D structure. More specifically, the distance between the CB's of the 
amino acid residues comprising such attachment groups, or the distance between the func- 
tional group (NZ for lysine, CG for aspartic acid, CD for glutamic acid, SG for cysteine) of 
one and the CB of another amino acid residue comprising an attachment group are deter- 
30 mined. In case of glycine, CA is used instead of CB. In polypeptides according to the inven- 
tion, any of said distances is preferably more than 8A, in particular more than 10A, in order 
to avoid or reduce heterogeneous conjugation. 
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In another embodiment, one difference between the amino acid sequence of the 
wild-type polypeptide and the amino acid sequence of the monomeric unit used herein is 
that one or preferably more, e.g. 2-15, amino acid residues comprising an attachment group 
for a non-polypeptide moiety have been removed, preferably by substitution, from the wild- 
5 type amino acid sequence. The amino acid residue to be removed is preferably one to which 
conjugation is disadvantageous, e.g. an amino acid residue located at or near a functional 
site of the polypeptide (since conjugation at such a site may result in inactivation or reduced 
activity of the resulting conjugate due to impaired receptor recognition). In the present con- 
text the term "functional site" is intended to indicate one or more amino acid residues which 

10 is/are essential for or otherwise involved in the function or performance of the polypeptide, 
in particular receptor binding and/or activation. The functional site may be determined by 
methods known in the art and is preferably identified by analysis of a structure of the poly- 
peptide complexed to a relevant receptor. 

In preferred embodiments of the present invention, more than one amino acid resi- 

15 due of at least one monomeric unit is altered, e.g. the alteration embraces removal as well as 
introduction of amino acid residues comprising an attachment group for the non-polypeptide 
moiety of choice. In a particular embodiment, the amino acid sequence of the monomeric 
polypeptide unit may differ from the relevant wild-type amino acid sequence in that a) at 
least one amino acid residue comprising an attachment group for the non-polypeptide moi- 

20 ety and present in the wild-type amino acid sequence has been removed, preferably by sub- 
stitution, and b) at least one amino acid residue comprising an attachment group for the non- 
polypeptide moiety has been introduced into the amino acid sequence, preferably by substi- 
tution. This embodiment is considered of particular interest in that it is possible to specifi- 
cally design the polypeptide so as to obtain an optimal conjugation to the non-polypeptide 

25 moiety of choice. For instance, by introducing and removing selected amino acid residues, 
e.g. as exemplified below for G-CSF, it is possible to ensure an optimal distribution of at- 
tachment groups for the non-polypeptide moiety of choice, which gives rise to a conjugate in 
which the non-polypeptide moieties are placed so as to effectively shield epitopes and other 
surface parts of the polypeptide without substantially impairing the function of the polypep- 

30 tide. 

In one alternative of this embodiment, it is possible to produce a multimeric poly- 
peptide according to the invention in which, taking a dimer as an example, one of the 
monomeric units comprises either a wild-type sequence or a variant sequence modified such 
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that one or more attachment sites for a non-polypeptide moiety have been removed as com- 
pared to the wild-type, thereby avoiding attachment of undesired non-polypeptide moieties 
to this monomelic unit (although it may, if desired, maintain any naturally occurring glyco- 
sylation sites so as to allow glycosylation corresponding to the native polypeptide), while 
5 the other monomelic unit comprises a sequence with one or more attached non-polypeptide 
moieties. In this case, the latter monomelic unit may have an amino acid sequence corre- 
sponding to that of the wild-type, but with one or more polymer moieties attached thereto, 
e.g. PEG moieties, or it may be a variant with introduced and/or removed attachment groups 
for polymer and/or oligosaccharide moieties. The advantage of this approach is that the re- 

10 suiting multimeric polypeptide comprises a monomelic unit corresponding in sequence and 
possible glycosylation to the native polypeptide together with a monomelic unit modified 
with one or more attachment groups, and possibly with additional amino acid modifications, 
so as to give the overall monomelic polypeptide the desired properties in terms of biological 
activity, molecular weight, half-life, epitope shielding, etc. 

15 In addition to the removal and/or introduction of amino acid residues, the polypep- 

tide may comprise other substitutions or glycosylations which are not related to introduction 
and/or removal of amino acid residues comprising an attachment group for the non- 
polypeptide moiety. 

In order to avoid too much disruption of the structure and function of the parent 
20 molecule, the total number of amino acid residues to be altered in accordance with the pre- 
sent invention, e.g. as exemplified for G-CSF below, will typically not exceed 15. The exact 
number of amino acid residues and the type of amino acid residues to be introduced and/or 
removed depend, La., on the desired nature and degree of conjugation (e.g. the identity of 
the non-polypeptide moiety, how many non-polypeptide moieties it is desirable or possible 
25 to conjugate to the polypeptide, where in the polypeptide conjugation should be performed 
or avoided, etc.). Preferably, the polypeptide part of the conjugate of the invention or the 
polypeptide of the invention comprises an amino acid sequence which differs in 1-15 amino 
acid residues from the native amino acid sequence such as in 1-8 or 2-8 amino acid residues, 
e.g. in 1-5 or 2-5 amino acid residues. Thus, normally the polypeptide part of the conjugate 
30 or the polypeptide of the invention comprises an amino acid sequence which differs from the 
native amino acid sequence in 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15 amino acid 
residues. 
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The amino acid residue comprising an attachment group for a non-polypeptide 
moiety, whether it is removed or introduced, is selected on the basis of the nature of the non- 
polypeptide moiety of choice and, in most instances, on the basis of the method in which 
conjugation between the polypeptide and the non-polypeptide moiety is to be achieved. It 
5 will be understood that in order to preserve a measurable function of the conjugate or poly- 
peptide, amino acid residues to be modified (by deletion, preferably by substitution) are se- 
lected from those amino acid residues which are not essential for providing a measurable 
activity. Accordingly, amino acid residues to be modified are different from those required 
for receptor binding or activation. 
10 Amino acid residue modifications in one or more monomelic units of the mul- 

timeric polypeptides of the invention are preferably selected from among the following: 

• introduction of a lysine residue, typically substitution in place of a non-lysine resi- 
due, 

• removal of a lysine residue, in particular introduction of an arginine or glutamine 
15 residue in place of a lysine residue, 

• introduction of a cysteine residue, typically substitution in place of a non-cysteine 
residue, 

• introduction of an aspartic acid residue, typically substitution in place of a non- 
aspartic acid residue, 

20 • introduction of a glutamic acid residue, typically substitution in place of a non- 
glutamic acid residue, 

• removal of a cysteine, aspartic acid or glutamic acid residue, typically substitution by 
another amino acid residue, 

• introduction and/or removal of a histidine residue, typically by substitution, 
25 • introduction of an N- or O-glycosylation site, and 

• removal of an N- or O-glycosylation site. 

The conjugate of the present invention will normally have one or more improved 
properties as compared to the native polypeptide, including increased functional in vivo half- 
life, increased serum half-life, reduced clearance, reduced immunogenicity and/or increased 
30 bioavailability. Consequently, medical treatment with a conjugate of the invention offers a 
number of advantages over the currently available compounds, including longer duration 
between injections and fewer side effects. 
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As explained above, the increased functional in vivo half-life is normally obtained 
as a consequence of the conjugate of the invention having a reduced susceptibility to renal 
and/or receptor-mediated clearance as compared to the native monomeric polypeptide, in 
part as a result of an increased molecular weight. It should be noted in this regard that the 
5 actual molecular weight does not necessarily have to be above any certain limit, e.g. 60 or 
65 kDa, in order to achieve reduced renal clearance, since renal clearance depends not only 
on the molecular weight as such, but also on e.g. on the three-dimensional structure and bulk 
of the conjugate. Thus, for purposes of reduced renal clearance, it is sufficient that the "ap- 
parent size" of the conjugate, e.g. as determined by SDS-PAGE, is sufficiently high, e.g. at 

10 least about 60 or 65 kDa, such as at least about 66 or 67 kDa. Polymer molecules such as 
PEG have been found to be particularly useful for adjusting the molecular weight of the con- 
jugate and for providing a sufficiently high apparent size. In conjugates of the invention, 
each monomeric polypeptide unit will often have a molecular weight (exclusive of any 
polymer moiety bound to said unit) of less than about 34 kDa, such as less than about 30 

15 kDa. 

In one embodiment, the multimeric polypeptide conjugate of the invention thus 
comprises at least one non-polypeptide moiety bound to an attachment group of at least one 
of the monomeric polypeptide units, such that the apparent size of the multimeric polypep- 
tide conjugate is at least about 60 kDa, such as at least about 65 kDa, e.g. at least about 67 
20 kDa. 

Generally, activation of the receptor is associated with receptor-mediated clearance 
(RMC) such that binding of a polypeptide to its receptor without activation does not lead to 
RMC, while activation of the receptor leads to RMC. The clearance is due to internalisation 
of the receptor-bound polypeptide with subsequent lysosomal degradation. Reduced RMC 

25 may be achieved by designing the conjugate so as to be able to bind and activate a sufficient 
number of receptors to obtain optimal in vivo biological response and avoid activation of 
more receptors than required for obtaining such response. This may be reflected in reduced 
in vitro bioactivity and/or increased off-rate. 

Typically, reduced in vitro bioactivity reflects reduced efficacy/efficiency and/or 

30 reduced potency and may be determined by any suitable method for determining any of 
these properties. For instance, in vitro bioactivity may be determined in a lucif erase based 
assay (see Materials and Methods). 
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For G-CSF it has been found that a relatively low in vitro bioactivity, compared to 
the bioactivity of hG-CSF (SEQ ID NO:l), is advantageous in terms of both a long plasma 
half-life and a high degree of stimulation of neutrophils. This is explained in further detail 
below. 

5 Single-chain G-CSF polypeptides and conjugates 

A preferred single-chain polypeptide conjugate of the invention is one comprising 
two or more subunits of a G-CSF polypeptide, e.g. two, three or four subunits, although 
normally the multimeric polypeptide will contain two subunits. In the case of two subunits, 
these may be two monomers each comprising the amino acid sequence of hG-CSF as shown 

10 in SEQ ID NO: 1, a monomer having the sequence of hG-CSF together with a monomer hav- 
ing an amino acid sequence that is altered relative to hG-CSR, or two monomers with al- 
tered amino acid sequences relative to hG-CSF. 

One embodiment of this aspect of the invention thus relates to a single-chain mul- 
timeric polypeptide having G-CSF activity, comprising at least two monomeric units inde- 

15 pendently selected from (a) hG-CSF with the amino acid sequence shown in SEQ ID NO:l 
and (b) variants of hG-CSF, said monomeric units being linked via a peptide bond or a pep- 
tide linker, wherein the polypeptide has at least one non-polypeptide moiety covalently 
bound to an attachment group of the polypeptide. 

Another embodiment of this aspect of the invention relates to a single-chain mul- 

20 timeric polypeptide having G-CSF activity, comprising at least two monomeric units inde- 
pendently selected from (a) hG-CSF with the amino acid sequence shown in SEQ ID NO:l 
and (b) variants of hG-CSF, said monomeric units being linked via a peptide bond or a pep- 
tide linker, wherein the polypeptide has at least one non-polypeptide moiety covalently 
bound to an attachment group of the polypeptide and exhibits an in vitro bioactivity in the 

25 range of about 2-30% of the bioactivity of non-conjugated hG-CSF as determined by the 
luciferase assay described herein. 

In a further embodiment, the invention relates to a single-chain multimeric polypep- 
tide having G-CSF activity, comprising at least two monomeric units independently selected 
from (a) hG-CSF with the amino acid sequence shown in SEQ ID NO:l and (b) variants of 

30 hG-CSF, said monomeric units being linked via a peptide bond or a peptide linker, the poly- 
peptide comprising at least one covalently bound polymer molecule selected from the group 
consisting of linear and branched polyalkylene oxides. 
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In a particular embodiment, the invention relates to a single-chain multimeric poly- 
peptide having G-CSF activity, comprising at least two monomelic units independently se- 
lected from (a) hG-CSF with the amino acid sequence shown in SEQ ID NO:l and (b) vari- 
ants of hG-CSF, said monomelic units being linked via a peptide bond or a peptide linker, 
5 the polypeptide comprising at least one covalently bound polyethylene glycol molecule. 

Unless otherwise indicated, the term "G-CSF" as used herein is intended to refer to 
polypeptides comprising the amino acid sequence of wild-type human G-CSF as set forth in 
SEQ ID NO:l as well as variants thereof with one or more changes in the form of substitu- 
tions, additions/insertions or deletions compared to SEQ ID NO:l. Such G-CSF variants are 
10 disclosed in detail below. 

The amino acid residue modifications in G-CSF variant monomelic units may in 
particular be selected from the group consisting of introduction of a lysine, cysteine, aspar- 
tic acid, glutamic acid or histidine residue, and removal of a lysine, cysteine, aspartic acid, 
glutamic acid or histidine residue. Alternatively or additionally, at least one of the mono- 
15 mers of the single-chain multimeric G-CSF polypeptide of this aspect of the invention may 
comprise at least one amino acid residue modification selected from the group consisting of 
introduction of an N- or O-glycosylation site, and removal of an O-glycosylation site. 

In another embodiment, the single-chain multimeric G-CSF polypeptide comprises 
at least two G-CSF polypeptide monomers linked via a peptide bond or a peptide linker, 
20 wherein at least one of said monomers is a variant of wild-type human G-CSF comprising at 
least one introduced attachment site for a polymer moiety. 

In a further embodiment, the single-chain multimeric G-CSF polypeptide comprises 
at least two G-CSF polypeptide monomers linked via a peptide bond or a peptide linker, 
wherein at least one of said monomers is a variant of wild-type human G-CSF wherein at 
25 least one attachment site for a non-polypeptide moiety has been introduced in a position that 
in wild-type human G-CSF is occupied by a surface-exposed amino acid residue. 

In a still further embodiment, the single-chain multimeric G-CSF polypeptide com- 
prises at least two G-CSF polypeptide monomers linked via a peptide linker, wherein the 
peptide linker comprises at least one amino acid residue comprising an attachment group for 
30 a non-polypeptide moiety. 

As described in the literature, the principal biological effect of G-CSF in vivo is to 
stimulate the growth and development of neutrophils (Welte et al., PNAS USA 82:1526- 
1530, 1985, Souza et al., Science, 232:61-65, 1986). The amino acid sequence of human G- 



WO 02/36626 



PCT/DK01/00724 



28 

CSF (hG-CSF) was reported by Nagata et al„ Nature 319:415-418, 1986, and is shown in 
the appended SEQ ID NO:l. 

Recombinant human G-CSF (rhG-CSF) is generally used for treating various forms 
of leukopenia/neutropenia, i.e. a reduced white blood cell count that arises e.g. as a result of 
5 chemotherapy. Since leukopenia is a serious side effect of chemotherapy that increases the 
risk of infection and can reduce the effectiveness of chemotherapy, it is important to be able 
to reduce the time period during which patients are subject to leukopenia. 

Commercial preparations of rhG-CSF are available under the names filgrastim 
(Gran® and Neupogen®), lenograstim (Neutrogin® and Granocyte®) and nartograstim 

10 (Neu-up®). Gran® and Neupogen® are non-glycosylated and produced in recombinant E, 
coli cells. Neutrogin® and Granocyte® are glycosylated and produced in recombinant CHO 
cells and Neu-up® is non-glycosylated with five amino acids substituted at the N-terminal 
region of intact rhG-CSF produced in recombinant E. coli cells. 

The commercially available rhG-CSF has a shoit-term pharmacological effect and 

15 must often be administered more than once per day for the duration of the leukopenic state. 
A molecule with a longer circulation half-life would decrease the number of administrations 
necessary to alleviate the leukopenia and prevent consequent infections. Given the potential 
for obtaining more optimal therapeutic hG-CSF levels with concomitant enhanced therapeu- 
tic effect using less fewer injections, there is clearly a need for improved hG-CSF-like mole- 

20 cules. 

As indicated above, it has been found that a relatively low in vitro G-CSF bioactiv- 
ity is advantageous. In a preferred embodiment, the in vitro bioactivity of a multimeric G- 
CSF conjugate of the invention is in the range of about 1-30%, preferably about 2-30%, of 
the bioactivity of non-conjugated hG-CSF as determined by the lucif erase assay described 

25 herein. The in vitro bioactivity of such conjugates is thus preferably reduced by at least 

about 70%, such as by at least about 75%, e.g. by at least about 80% or 85%, as compared to 
the in vitro bioactivity of hG-CSF, determined under comparable conditions. Expressed dif- 
ferently, the conjugate may have an in vitro bioactivity that is as small as about 1%, typi- 
cally at least about 2%, such as at least about 3%, 4% or 5%, of that of the corresponding 

30 non-conjugated hG-CSF polypeptide. For instance, the in vitro bioactivity may be in the 
range of about 2-30% of that of the reference polypeptide, e.g. about 3-25% or 4-20%, de- 
termined under comparable conditions. In cases where reduced in vitro bioactivity is desired 
in order to reduce receptor-mediated clearance, it will be clear that sufficient bioactivity to 
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obtain the desired receptor activation must be nevertheless be maintained, which is why the 
bioactivity should be at least about 1-2% of that of hG-CSF and preferably slightly higher as 
given above. 

In a preferred embodiment, a single-chain multimeric G-CSF conjugate is in the 
5 form of a dimer with either two wild-type monomelic units or with one or possibly two units 
which are variants of the wild-type human G-CSF (hG-CSF), and preferably attached to one 
or more PEG moieties. 

In addition to the above considerations regarding in vitro bioactivity, it has further 
been found that advantageous results are obtained when the apparent size (also referred to as 
10 the "apparent molecular weight" or "apparent mass") of the polypeptide conjugates of the 
invention, or at least a majority of such conjugates, is at least about 50 kDa, preferably at 
least about 55 kDa, more preferably at least about 60 kDa, e.g. at least about 66 kDa. This is 
believed to be due to the fact that renal clearance is substantially eliminated for conjugates 
having a sufficiently large apparent size. In the present context, the "apparent size" of a G- 
15 CSF conjugate or polypeptide is determined by SDS-PAGE as described in the examples 
section below. 

It will be understood that the apparent size in kDa of a conjugate or polypeptide is 
not necessarily the same as the actual molecular weight of the conjugate or polypeptide. 
Rather, the apparent size is a reflection of both the actual molecular weight and the overall 

20 bulk. Since, in most cases, attachment of one or more PEG groups or other non-polypeptide 
moieties will result in a relatively large increase of the bulk of the polypeptide to which such 
moieties are attached, the polypeptide conjugates of the invention will normally have an 
apparent size that exceeds the actual molecular weight of the conjugate. Therefore, in con- 
nection with renal clearance, a conjugate of the invention can easily exhibit properties char- 

25 acteristic of a polypeptide with a molecular weight above e.g. 50 kDa (corresponding to the 
apparent size) but have an actual molecular weight below 50 kDa. 

In a further preferred embodiment, the multimeric G-CSF conjugates of the inven- 
tion have both an apparent size of at least about 50 kDa and a reduced in vitro bioactivity 
(reduced receptor binding affinity) compared to hG-CSF as explained above. It has been 

30 found that such conjugates have both a low renal clearance as a result of the large apparent 
size and a low receptor-mediated clearance as a result of the low in vitro bioactivity (low 
receptor binding affinity). The overall result is excellent performance in terms of effective 
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stimulation of neutrophils together with a significantly increased in vivo half-life and thus a 
long duration of action that provides important clinical advantages. 

In the following, the invention will be illustrated by way of example with reference 
to different variants based on hG-CSF. For the sake of simplicity, the numbering of the pos- 
5 sible amino acid modifications below will be with reference to the known amino acid se- 
quence of the hG-CSF monomer as shown in SEQ ID NO:l. It will be understood that the 
modifications illustrated below may, according to the properties desired in any given case, 
be performed in either the C-terminal or N-terminal monomelic unit or both. Further details 
regarding G-CSF variants and methods for producing such variants are found in 
10 PCT/DK01/0001 1 and U.S. Ser. No. 09/760,008, which are hereby incorporated by refer- 
ence. 

Conjugate of the invention wherein the non~polypeptide moiety is attached to a lysine or the N- 
terniinal amino acid residue 

15 In a preferred embodiment the conjugate of the invention is one wherein the amino 

acid residue comprising an attachment group for the non-polypeptide moiety is a lysine resi- 
due and the non-polypeptide moiety is any molecule which has lysine as an attachment 
group. For instance, the non-polypeptide moiety may be a polymer molecule, in particular 
any of the molecules mentioned in the section entitled "Conjugation to a polymer molecule", 

20 and preferably selected from the group consisting of linear or branched polyethylene glycol 
or another polyalkylene oxide. Most preferably, the polymer molecule is a PEG such as 
mPEG-SPA (Shearwater Corp.) or oxycarbonyl-oxy-N-dicarboxyimide PEG (US 
5,122,614). 

25 i) Introduction of lysine residues 

In order to obtain a more extensive or differently distributed conjugation, it may be 
desirable to introduce at least one non-naturally occurring lysine residue, in particular in a 
position which is occupied by an amino acid residue having a side chain which is more than 
25% surface exposed and not part of a cystine or located at a receptor binding site. 

30 Accordingly, in one embodiment the conjugate of the invention is one which com- 

prises a non-polypeptide moiety having lysine as attachment group and a polypeptide com- 
prising an amino acid sequence that differs from the native sequence in that at least one ly- 
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sine residue has been introduced. Residues are in particular introduced such that they have 
more than 50% of the side chain surface exposed. 

In a dimeric G-CSF conjugate, one or both of the monomelic units may thus have 
an amino acid sequence that differs from the amino acid sequence shown in SEQ ID NO:l 
5 in at least one substitution selected from the group consisting of T1K, P2K, L3K, G4K, 
P5K, A6K, S7K, S8K, L9K, P10K, Q11K, S12K, F13K, L14K, L15K, E19K, Q20K, V21K, 
R22K, Q25K, G26K, D27K, A29K, A30K, E33K, A37K, T38K, Y39K, L41K, H43K, 
P44K, E45K, E46K, V48K, L49K, L50K, H52K, S53K, L54K, I56K, P57K, P60K, L61K, 
S62K, S63K, P65K, S66K, Q67K, A68K, L69K, Q70K, L71K, A72K, G73K, S76K, Q77K, 

10 L78K, S80K, F83K, Q86K, G87K, Q90K, E93K, G94K, S96K, P97K, E98K, L99K, 

G100K, P101K, T102K, D104K, T105K, Q107K, L108K, D109K, All IK, D112K, F113K, 
T115K, T116K, W118K, Q119K, Q120K, M121K, E122K, E123K, L124K, M126K, 
A127K, P128K, A129K, L130K, Q131K, P132K, T133K, Q134K, G135K, A136K, 
M137K, P138K, A139K, A141K, S142K, A143K, F144K, Q145K, R146K, R147K, S155K, 

15 H156K, Q158K, S159K, L161K, E162K, V163K, S164K, Y165K, R166K, V167K, L168K, 
R169K, H170K, L171K, A172K, Q173K and P174K. 

Examples of preferred amino acid substitutions include one or more of Q70K, 
Q90K, T105K, Q120K, T133K, R146K, R147K, S159K, R166K and R169K. 

The polypeptide conjugate of the invention having introduced and/or removed at 

20 least one lysine is preferably in vivo glycosylated, e.g. using naturally-occurring glycosyla- 
tion sites present in the polypeptide. However, in a particular embodiment the conjugate is 
one wherein the amino acid sequence of the polypeptide differs from that of the native poly- 
peptide in that at least one N-glycosylation site has been introduced and/or removed. Such 
introduced/removed sites may any of those described in the section entitled "Conjugate of 

25 the invention wherein the non-polypeptide moiety is an oligosaccharide moiety". 

ii) Removal of lysine residues 

In order to avoid conjugation to one or more of the lysine residues present in one or 
more of the monomelic units (since these may inactivate or severely reduce the activity of 
30 the resulting conjugate if they are located in the receptor-binding domain), it may be desir- 
able to remove at least one lysine residue. Accordingly, the conjugate according to this em- 
bodiment comprises at least one monomelic polypeptide unit comprising an amino acid se- 
quence that differs from the native amino acid sequence in the removal of at least one lysine 
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residue, in particular a lysine residue selected from those having more than 25% of their side 
chains surface exposed, preferably selected from those having more than 50% of their side 
chain surface exposed. 

The removal is preferably achieved by substitution by any other amino acid residue, 
5 in particular an arginine or a glutamine residue. 

hG-CSF contains four lysine residues, of which K16 is located in the receptor- 
binding domain and the others are located in positions 23, 34 and 40, respectively, all rela- 
tively close to the receptor-binding domain. Accordingly, one or both of the G-CSF mono- 
melic units may comprise an amino acid sequence which, in addition to or instead of having 

10 one or more of the substitutions to lysine listed above, is modified in relation to hG-CSF by 
removal of at least one of the amino acid residues selected from the group consisting of K16, 
K23, K34 and K40, in particular at least K16, the removal preferably being achieved by sub- 
stitution by any other amino acid residue, in particular an arginine or a glutamine residue. 
One or both of the subunits may thus have a single lysine residue removed, or all of the four 

15 native lysine residues removed, or two or three lysine residues removed, i.e. selected from 
the group consisting of: K16+K23; K16+K34; K16+K40; K16+K23+K34; K16+K23+K40; 
K16+K34+K40; K23+K34; K23+K40; K23+K34+K40; and K34+K40. 

The single-chain G-CSF polypeptide according to this aspect of the invention pref- 
erably comprises at least one monomelic unit having at least one of the substitutions se- 

20 lected from the group consisting of K16R, K16Q, K23R, K23Q, K34R, K34Q, K40R and 
K40Q, more preferably at least one of the substitutions K16R and K23R, whereby conjuga- 
tion of these residues can be avoided. Preferably, the polypeptide comprises at least one sub- 
stitution selected from the group consisting of K16R+K23R, K16R+K34R, K16R+K40R, 
K23R+K34R, K23R+K40R, K34R+K40R, K16R+K23R+K34R, K16R+K23R+K40R, 

25 K23R+K34R+K40R and K16R+K34R+K40R. These substitutions are likely to give rise to 
the least structural difference. 

In a preferred embodiment of this aspect of the invention, both of the monomelic 
units of the single-chain G-CSF polypeptide are modified by removal of one or more lysines 
as described above. The lysines that are removed may be the same in the two subunits or 

30 they may differ. If desired, removal of lysines in this embodiment may be accompanied by 
introduction of one or more lysines in one or both of the subunits, although it has been found 
that surprisingly advantageous results are obtained simply by removing one or more lysines 
from the two subunits, without any introduction of lysines, when lysine PEGylation is used. 
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Hi) Introduction and removal of lysine residues 

In one embodiment the conjugate of the invention comprises at least one introduced 
lysine residue and at least one removed lysine residue. 

5 

Conjugate of the invention having a non4ysine residue as an attachment group 

Based on the present disclosure the skilled person will be aware that amino acid 
residues comprising other attachment groups may be introduced into and/or removed from 
the native monomelic polypeptide using the same approach as that illustrated above with 

10 lysine residues. For instance, one or more amino acid residues comprising an acid group 
(glutamic acid or aspartic acid), asparagine, histidine, tyrosine or cysteine may be introduced 
into positions which are occupied by amino acid residues having surface exposed side 
chains, or removed (preferably by substitution by any other amino acid residue). Such modi- 
fied polypeptides may further be in vivo glycosylated. 

15 In the case of a single-chain G-CSF conjugate according to the invention, cysteine 

attachment groups may be provided by means of an amino acid sequence that differs, in at 
least one monomelic unit, from the amino acid sequence of hG-CSF shown in SEQ ID NO:l 
in at least one substitution selected from the group consisting of TIC, P2C, L3C, G4C, P5C, 
A6C, S7C, S8C, L9C, P10C, Q11C, S12C, F13C, L14C, L15C, E19C, Q20C, V21C, R22C, 

20 Q25C, G26C, D27C, A29C, A30C, E33C, A37C, T38C, Y39C, L41C, H43C, P44C, E45C, 
E46C, V48C, L49C, L50C, H52C, S53C, L54C, I56C, P57C, P60C, L61C, S62C, S63C, 
P65C, S66C, Q67C, A68C, L69C, Q70C, L71C, A72C, G73C, S76C, Q77C, L78C, S80C, 
F83C, Q86C, G87C, Q90C, E93C, G94C, S96C, P97C, E98C, L99C, G100C, P101C, 
T102C, D104C, T105C, Q107C, L108C, D109C, A111C, D112C, F113C, T115C, T116C, 

25 W118C, Q119C, Q120C, M121C, E122C, E123C, L124C, M126C, A127C, P128C, A129C, 
L130C, Q131C, P132C, T133C, Q134C, G135C, A136C, M137C, P138C, A139C, A141C, 
S142C, A143C, F144C, Q145C, R146C, R147C, S155C, H156C, Q158C, S159C, L161C, 
E162C, V163C, S164C, Y165C, R166C, V167C, L168C, R169C, H170C, L171C, A172C, 
Q173C and P174C. The receptor-binding domain of hG-CSF contains a cysteine residue in 

30 position 17 which does not take part in a cystine and which may therefore advantageously be 
removed in order to avoid conjugate of a non-polypeptide moiety to said cysteine. Although 
C17 may be substituted by any other amino acid residue, it is in particular substituted by a 
serine residue. 
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Examples of preferred substitutions according to this aspect of the invention in- 
clude R146C, R147C, R166C and R169C. 

A single-chain G-CSF conjugate according to the invention may also have one or 
more non-polypeptide moieties bound to an acid group or to the C-terminal amino acid resi- 
5 due, in particular to an aspartic acid or glutamic acid residue. In this case, the amino acid 
sequence of at least one of the monomeric units may differ from the amino acid sequence 
shown in SEQ ID NO:l in at least one substitution selected from the group consisting of 
T1D, P2D, L3D, G4D, P5D, A6D, S7D, S8D, L9D, P10D, Q11D, S12D, F13D, L14D, 
L15D, K16D, Q20D, V21D, R22D, K23D, Q25D, G26D, A29D, A30D, K34D, A37D, 

10 T38D, Y39D, K40D, L41D, H43D, P44D, V48D, L49D, L50D, H52D, S53D, L54D, I56D, 
P57D, P60D, L61D, S62D, S63D, P65D, S66D, Q67D, A68D, L69D, Q70D, L71D, A72D, 
G73D, S76D, Q77D, L78D, S80D, F83D, Q86D, G87D, Q90D, G94D, S96D, P97D, L99D, 
G100D, P101D, T102D, T105D, Q107D, L108D, All ID, F113D, T115D, T116D, W118D, 
Q119D, Q120D, M121D, L124D, M126D, A127D, P128D, A129D, L130D, Q131D, 

15 P132D, T133D, Q134D, G135D, A136D, M137D, P138D, A139D, A141D, S142D, A143D, 
F144D, Q145D, R146D, R147D, S155D, H156D, Q158D, S159D, L161D, V163D, S164D, 
Y165D, R166D, V167D, L168D, R169D, H170D, L171D, A172D, Q173D andP174D; 
or alternatively at least one substitution selected from the group consisting of TIE, P2E, 
L3E, G4E, P5E, A6E, S7E, S8E, L9E, P10E, QUE, S12E, F13E, L14E, L15E, K16E, 

20 Q20E, V21E, R22E, K23E, Q25E, G26E, A29E, A30E, K34E, A37E, T38E, Y39E, K40E, 
L41E, H43E, P44E, V48E, L49E, L50E, H52E, S53E, L54E, I56E, P57E, P60E, L61E, 
S62E, S63E, P65E, S66E, Q67E, A68E, L69E, Q70E, L71E, A72E, G73E, S76E, Q77E, 
L78E, S80E, F83E, Q86E, G87E, Q90E, G94E, S96E, P97E, L99E, G100E, P101E, T102E, 
T105E, Q107E, L108E, All IE, F113E, T115E, T116E, W118E, Q119E, Q120E, M121E, 

25 L124E, M126E, A127E, P128E, A129E, L130E, Q131E, P132E, T133E, Q134E, G135E, 
A136E, M137E, P138E, A139E, A141E, S142E, A143E, F144E, Q145E, R146E, R147E, 
S155E, H156E, Q158E, S159E, L161E, V163E, S164E, Y165E, R166E, V167E, L168E, 
R169E, H170E, L171E, A172E, Q173E and P174E. 

Examples of preferred substitutions according to this aspect of the invention in- 

30 elude Q67E, Q70E, Q77E, Q86E, Q90E, Q120E, Q131E, Q134E, Q145E and Q173E. 

In addition to one or more of the above listed substitutions to aspartic and/or glu- 
tamic acid, a G-CSF monomeric unit may by modified in relation to hG-CSF as shown in 
SEQ ID NO:l by removal, preferably by substitution, of at least one of the amino acid resi- 
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dues selected from the group consisting of D27, D104, D109, D112, E19, E33, E45, E46, 
E93, E98, E122, E123 and E163. The substitution may be by any other amino acid residue, 
in particular by an asparagine or a glutamine residue, whereby conjugation of these residues 
to a non-polypeptide moiety can be avoided. 

5 

Conjugate of the invention wherein the non-polypeptide moiety is a carbohydrate moiety 

In a further aspect the invention relates to a conjugate comprising a glycosylated 
polypeptide in which at least one non-naturally occurring glycosylation site has been intro- 
duced into the amino acid sequence. 

10 A suitable N-glycosylation site may be introduced by introducing, preferably by 

substitution, an asparagine residue in a position occupied by an amino acid residue having 
more than 25% of its side chain exposed at the surface of the polypeptide, which position 
does not have a proline residue located in position +1 or +3 therefrom. If the amino acid 
residue located in position +2 is a serine or threonine, no further amino acid substitution is 

15 required. However, if this position is occupied by a different amino acid residue, a serine or 
threonine residue needs to be introduced. 

In a dimeric G-CSF conjugate of the invention, one or more non-naturally occur- 
ring glycosylation sites may be introduced into at least one monomelic unit relative to the 
amino acid sequence of hG-CSF by way of at least one substitution selected from the group 

20 consisting of L3N+P5S/T, P5N, A6N, S8N+P10S/T, P10N, Ql 1N+F13S/T, S 12N+L14S/T, 
F13N+L15S/T, L14N+K16S/T, K16N+L18S/T, E19N+V21S/T, Q20N+R22S/T, 
V21N+K23S/T, R22N+I24S/T, K23N+Q25S/T, Q25N+D27S/T, G26N+G28S/T, 
D27N+A29S/T, A29N+L31S/T, A30N+Q32S/T, E33N+L35S/T, A37N+Y39S/T, 
T38N+K40S/T, Y39N+L41S/T, P44N+E46S/T, E45N+L47S/T, E46N+V48S/T, 

25 V48N+L50S/T, L49N+G5 1S/T, L50N+H52S/T, H52N+L54S/T, S53N+G55S/T, P60N, 
L61N, S63N+P65S/T, P65N+Q67S/T, S66N+A68S/T, Q67N+L69S/T, A68N+Q70S/T, 
L69N+L71S/T, Q70N+A72S/T, L71N+G73S/T, G73N+L75S/T, S76N+L78S/T, 
Q77N+H79S/T, L78N, S80N+L82S/T, F83N+Y85S/T, Q86N+L88S/T, G87N+L89S/T, 
Q90N+L92S/T, E93N+I95S/T, P97N+L99S/T, L99N+P101S/T, P101N+L103S/T, 

30 T102N+D104S/T, D104N+L106S/T, T105N+Q107S/T, Q107N+D109S/T, 

L108N+V110S/T, D109N+A111S/T, A111N+F113S/T, D112N+A114S/T, F113N, 
T115N+I117S/T, T116N+W118S/T, W118N+Q120S/T, Q119N+M121S/T, 
Q120N+E122S/T, M121N+E123S/T, E122N+L124S/T, E123N+G125S/T, 
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L124N+M126S/T, M126N+P128S/T, P128N+L130S/T, L130N+P132S/T, 
P132N+Q134S/T, T133N+G135S/T, Q134N+A136S/T, A136N+P138S/T, 
P138N+F140S/T, A139N+A141S/T, A141N+A143S/T, S142N+F144S/T, 
A143N+Q145S/T, F144N+R146S/T, Q145N+R147S/T, R146N+A148S/T, 
5 R147N+G149S/T, S155N+L157S/T, H156N+Q158S/T, S159N+L161S/T, 

L161N+V163S/T, E162N, V163N+Y165S/T, S164N+R166S/T, Y165N+V167S/T, 
R166N+L168S/T, V167N+R169S/T, L168N+H170S/T, R169N+L171S/T and 
H170N+A172S/T, wherein S/T indicates an S or a T residue, preferably a T residue. 

Alternatively, the conjugate according to this aspect may comprise at least one 

10 monomelic unit comprising an amino acid sequence that differs from that shown in SEQ ID 
NO :1 in at least one substitution selected from the group consisting of P5N, A6N, PI ON, 
P60N, L61N, L78N, F113N and E162N, in particular from the group consisting of P5N, 
A6N, P10N, P60N, L61N, Fl 13N and E162N, such as from the group consisting of P60N, 
L61N, F113N and E162N. 

15 Alternatively, the conjugate according to this aspect may comprise at least one 

monomelic unit comprising an amino acid sequence that differs from that shown in SEQ ID 
NO:l in at least one substitution selected from the group consisting of D27N+A29S, 
D27N+A29T, D104N+L106S, D104N+L106T, D109N+A111S, D109N+A111T, 
Dl 12N+A1 14S and Dl 12N+A1 14T, more preferably from the group consisting of 

20 D27N+A29S, D27N+A29T, D104N+L106S, D104N+L106T, D112N+A114S and 
D112N+A114T, such as from the group consisting of D27N+A29S, D27N+A29T, 
D104N+L106S andD104N+L106T. 

Alternatively or additionally, the polypeptide may have an amino acid sequence in 
which at least one naturally occurring N-glycosylation site has been removed. 

25 Furthermore, the amino acid sequence of a polypeptide having at least one of the 

above mentioned N-glycosylation site modifications may differ from the native sequence in 
that at least one lysine residue has been removed as identified above in the section entitled 
"Removal of lysine residues". 

It will be understood that in order to prepare a conjugate according to this aspect of 

30 the invention, the polypeptide must be expressed in a glycosylating host cell capable of 
attaching oligosaccharide moieties at the glycosylation sites or alternatively subjected to in 
vitro glycosylation. Examples of glycosylating host cells are given in the section below enti- 
tled "Coupling to an oligosaccharide moiety". 
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In addition to a carbohydrate molecule, the conjugate according to the aspect of the 
invention described in the present section may contain additional non-polypeptide moieties, 
in particular a polymer molecule conjugated to one or more, optionally introduced attach- 
ment groups present in the polypeptide part of the conjugate. 

5 

Non-polypeptide moiety of the conjugate of the invention 

As indicated above, the non-polypeptide moiety of the conjugate of the invention is 
preferably selected from the group consisting of a polymer molecule, a lipophilic compound, 
an oligosaccharide moiety (e.g. by way of in vivo glycosylation) and an organic derivatizing 

10 agent. All of these moieties may confer desirable properties to the polypeptide part of the 
conjugate, in particular an increased functional in vivo half-life and/or an increased serum 
half -life. The polypeptide part of the conjugate is normally conjugated to only one type of 
non-polypeptide moiety, but it may also be conjugated to two or more different types of 
non-polypeptide moieties, e.g. to a polymer molecule and an oligosaccharide moiety, to a 

15 lipophilic group and an oligosaccharide moiety, to an organic derivatizing agent and an oli- 
gosaccharide moiety, to a lipophilic group and a polymer molecule, etc. The conjugation to 
two or more different non-polypeptide moieties may be done simultaneously or sequentially. 

Methods for preparing a conjugate of the invention 

In the following sections "Conjugation to a lipophilic compound", "Conjugation to 
20 a polymer molecule", "Conjugation to an oligosaccharide moiety" and "Conjugation to an 
organic derivatizing agent", conjugation to specific types of non-polypeptide moieties is 
described. 

Conjugation to a lipophilic compound 

25 The polypeptide and the lipophilic compound may be conjugated to each other ei- 

ther directly or by use of a linker. The lipophilic compound may be a natural compound such 
as a saturated or unsaturated fatty acid, a fatty acid diketone, a terpene, a prostaglandin, a 
vitamin, a carotenoid or steroid, or a synthetic compound such as a carbon acid, an alcohol, 
an amine or sulphonic acid with one or more alkyl, aryl, alkenyl or other multiple unsatu- 

30 rated compounds. The conjugation between the polypeptide and the lipophilic compound, 
optionally through a linker, may be done according to methods known in the art, e.g. as de- 
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scribed by Bodanszky in Peptide Synthesis, John Wiley, New York, 1976 and in WO 
96/12505. 

Conjugation to a polymer molecule 
5 The polymer molecule to be coupled to the polypeptide may be any suitable poly- 

mer molecule, such as a natural or synthetic homo-polymer or hetero-polymer, typically 
with a molecular weight in the range of 300-100,000 Da, such as 300-20,000 Da, more pref- 
erably in the range of 500-10,000 Da, such as in the range of 1000-5000 Da. Examples of 
homo-polymers include a polyol (i.e. poly-OH), a polyamine (i.e. poly-NBb) and a polycar- 

10 boxylic acid (i.e. poly-COOH). A hetero-polymer is a polymer which comprises different 
coupling groups, such as a hydroxyl group and an amine group. 

Examples of suitable polymer molecules include polymer molecules selected from 
the group consisting of polyalkylene oxide (PAO), including polyalkylene glycol (PAG), 
such as polyethylene glycol (PEG) and polypropylene glycol (PPG), branched PEGs, poly- 

15 vinyl alcohol (PVA), poly-carboxylate, poly-(vinylpyrolidone), polyethylene-co-maleic acid 
anhydride, polystyrene-co-maleic acid anhydride, dextran, including carboxymethyl- 
dextran, or any other biopolymer suitable for reducing immunogenicity and/or increasing 
functional in vivo half -life and/or serum half-life. Another example of a polymer molecule is 
human albumin or another abundant plasma protein. Generally, polyalkylene glycol-derived 

20 polymers are biocompatible, non-toxic, non-antigenic, non-immunogenic, have various wa- 
ter solubility properties, and are easily excreted from living organisms. 

PEG is the preferred polymer molecule, since it has only few reactive groups capa- 
ble of cross-linking compared to e.g. polysaccharides such as dextran. In particular, mono- 
functional PEG, e.g. methoxypolyethylene glycol (mPEG), is of interest since its coupling 

25 chemistry is relatively simple (only one reactive group is available for conjugating with 

attachment groups on the polypeptide). Consequently, the risk of cross-linking is eliminated, 
the resulting polypeptide conjugates are more homogeneous and the reaction of the polymer 
molecules with the polypeptide is easier to control. 

To effect covalent attachment of the polymer molecule(s) to the polypeptide, the 

30 hydroxyl end groups of the polymer molecule must be provided in activated form, i.e. with 
reactive functional groups. Suitable activated polymer molecules are commercially avail- 
able, e.g. from Shearwater Corporation, Huntsville, AL, USA. Alternatively, the polymer 
molecules can be activated by conventional methods known in the art, e.g. as disclosed in 
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WO 90/13540. Specific examples of activated linear or branched polymer molecules for use 
in the present invention are described in the Shearwater Corp. 2001 Catalog (Polyethylene 
Glycol and Derivatives for Biomedical Applications, incorporated herein by reference). 
Specific examples of activated PEG polymers include the following linear PEGs: NHS -PEG 
5 (e.g. SPA-PEG, SSPA-PEG, SBA-PEG, SS-PEG, SSA-PEG, SC-PEG, SG-PEG, and SCM- 
FEG), and NOR-PEG), BTC-PEG, EPOX-PEG, NCO-PEG, NPC-PEG, CDI-PEG, AED- 
PEG, TRES-PEG, VS-PEG, IODO-PEG, and MAL-PEG, and branched PEGs such as 
PEG2-NHS and those disclosed in US 5,932,462 and US 5,643,575, both of which are in- 
corporated herein by reference. Furthermore, the following publications, incorporated herein 

10 by reference, disclose useful polymer molecules and/or PEGylation chemistries: US 
5,824,778, US 5,476,653, WO 97/32607, EP 229,108, EP 402,378, US 4,902,502, US 
5,281,698, US 5,122,614, US 5,219,564, WO 92/16555, WO 94/04193, WO 94/14758, WO 
94/17039, WO 94/18247, WO 94/28024, WO 95/00162, WO 95/11924, WO95/13090, WO 
95/33490, WO 96/00080, WO 97/18832, WO 98/41562, WO 98/48837, WO 99/32134, WO 

15 99/32139, WO 99/32140, WO 96/40791, WO 98/32466, WO 95/06058, EP 439 508, WO 
97/03106, WO 96/21469, WO 95/13312, EP 921 131, US 5,736,625, WO 98/05363, EP 809 
996, US 5,629,384, WO 96/41813, WO 96/07670, US 5,473,034, US 5,516,673, EP 605 
963, US 5,382,657, EP 510 356, EP 400 472, EP 183 503 andEP 154 316. 

The conjugation of the polypeptide and the activated polymer molecules is con- 

20 ducted by use of any conventional method, e.g. as described in the following references 
(which also describe suitable methods for activation of polymer molecules): R.F. Taylor, 
(1991), "Protein immobilisation. Fundamental and applications", Marcel Dekker, N.Y.; S.S. 
Wong, (1992), "Chemistry of Protein Conjugation and Crosslinking", CRC Press, Florida, 
USA; G.T. Hermanson et al., (1993), "Immobilized Affinity Ligand Techniques", Academic 

25 Press, N. Y.). The skilled person will be aware that the activation method and/or conjugation 
chemistry to be used depends on the attachment group(s) of the polypeptide (examples of 
which are given further above), as well as the functional groups of the polymer (e.g. amine, 
hydroxyl, carboxyl, aldehyde, sulfydryl, succinimidyl, maleimide, vinysulfone or haloace- 
tate). The PEGylation may be directed towards conjugation to all available attachment 

30 groups on the polypeptide (i.e. such attachment groups that are exposed at the surface of the 
polypeptide) or may be directed towards one or more specific attachment groups, e.g. the N- 
terminal amino group (US 5,985,265). Furthermore, the conjugation may be achieved in one 
step or in a stepwise manner (e.g. as described in WO 99/55377). 
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It will be understood that the PEGylation is designed so as to produce the optimal 
molecule with respect to the number of PEG molecules attached, the size and form of such 
molecules (e.g. whether they are linear or branched), and the attachment site(s) in the poly- 
peptide. The molecular weight of the polymer to be used may e.g. be chosen on the basis of 
5 the desired effect to be achieved. For instance, if the primary purpose of the conjugation is 
to achieve a conjugate having a high molecular weight (e.g. to reduce renal clearance) it is 
usually desirable to conjugate as few high Mw polymer molecules as possible to obtain the 
desired molecular weight. When a high degree of epitope shielding is desirable this may be 
obtained by use of a sufficiently high number of low molecular weight polymer molecules 

10 (e.g. with a molecular weight of about 5000 Da) to effectively shield all or most epitopes of 
the polypeptide. For instance, 2-8, such as 3-6 such polymers may be used. 

In connection with conjugation to only a single attachment group on the protein (as 
described in US 5,985,265), it may be advantageous that the polymer molecule, which may 
be linear or branched, has a high molecular weight, e.g. about 20 kDa. 

15 Normally, the polymer conjugation is performed under conditions aimed at reacting 

most or substantially all available polymer attachment groups with polymer molecules, in 
particular by using a molar excess of the non-polypeptide moiety relative to the polypeptide. 
Typically, the molar ratio of activated polymer molecules to polypeptide is at least about 5:1 
and up to about 1000:1, e.g. in the range of from about 10:1 to about 200:1, in order to ob~ 

20 tain optimal reaction. 

It is also contemplated according to the invention to couple the polymer molecules 
to the polypeptide through a linker. Suitable linkers are well known to the skilled person. A 
preferred example is cyanuric chloride (Abuchowski et al., (1977), /. Biol Chem., 252, 
3578-3581; US 4,179,337; Shafer et al., (1986), J. Polym. Set Polym. Chem., 24, 375-378). 

25 Subsequent to the conjugation, residual activated polymer molecules are blocked 

according to methods known in the art, e.g. by addition of primary amine to the reaction 
mixture, and the resulting inactivated polymer molecules are removed by a suitable method. 

In a specific embodiment, the polypeptide conjugate of the invention is one which 
comprises a single PEG molecule attached to the N-terminal of the polypeptide and no other 

30 PEG molecules, in particular a linear or branched PEG molecule with a molecular weight of 
at least about 20 kDa. The polypeptide according to this embodiment may further comprise 
one or more oligosaccharide moieties attached to an N-linked or O-linked glycosylation site 
of the polypeptide or carbohydrate moieties attached by in vitro glycosylation. 
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In another specific embodiment, the polypeptide conjugate of the invention com- 
prises a PEG molecule attached to most or substantially all of the lysine residues in the 
polypeptide available for PEGylation, in particular a linear or branched PEG molecule, 
wherein each PEG e.g. has a molecular weight of about 5 kDa. 
5 In yet another embodiment, the polypeptide conjugate of the invention comprises a 

PEG molecule attached to most or substantially all of the lysine residues in the polypeptide 
available for PEGylation, and in addition to the N-terminal amino acid residue of the poly- 
peptide. 

Covalent in vitro coupling of carbohydrate moiety glycosides (such as dextran) to 
10 amino acid residues of the polypeptide may also be used, e.g. as described in WO 87/05330 
and in Aplin et al., CRC Crit. Rev. Biochem., pp. 259-306, 1981. The in vitro coupling of 
carbohydrate moieties or PEG to protein- and peptide-bound Gin residues can be carried out 
by transglutaminases (TGases). Transglutaminases catalyse the transfer of donor amine 
groups to protein- and peptide-bound Gin residues in a so-called cross-linking reaction. The 
15 donor- amine groups can be protein- or peptide-bound such as the e-amino group in Lys- 
residues or can be part of a small or large organic molecule. An example of a small organic 
molecule functioning as amino donor in TGase-catalysed cross-linking is putrescine (1,4- 
diaminobutane). An example of a larger organic molecule functioning as amino donor in 
TGase-catalysed cross-linking is an amine-containing PEG (Sato et al., Biochemistry 35, 
20 13072-13080). 

TGases, in general, are highly specific enzymes, and not every Gin residue exposed 
on the surface of a protein is accessible to TGase-catalysed cross-linking to amino- 
containing substances. On the contrary, only a few Gin residues function naturally as TGase 
substrates, but the exact parameters governing which Gin residues are good TGase sub- 
25 strates remain unknown. Thus, in order to render a protein susceptible to TGase-catalysed 
cross-linking reactions it is often a prerequisite at convenient positions to add stretches of 
amino acid sequence known to function well as TGase substrates. Several amino acid se- 
quences are known to be or to contain excellent natural TGase substrates e.g. substance P, 
elafin, fibrinogen, fibronectin, 0C2-plasmin inhibitor, oc-caseins, and (3-caseins. 

30 

Coupling to an oligosaccharide moiety 

The conjugation to an oligosaccharide moiety normally takes place by in vivo gly- 
cosylation effected by a glycosylating eucaryotic expression host. The expression host cell 
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may be selected from fungal (filamentous fungal or yeast), insect or animal cells or from 
transgenic plant cells. In one embodiment the host cell is a mammalian cell, such as a CHO 
cell, a BHK or a HEK cell, e.g. HEK 293, an insect cell, such as an SF9 cell, or a yeast cell, 
e.g. Saccharomyces cerevisiae or Pichia pastoris, or any of the host cells mentioned herein- 
5 after. As indicated above, glycosylation may alternatively be performed in vitro by a method 
known per se in the art. 

Coupling to an organic derivatizing agent 

Covalent modification of the polypeptide may be performed by reacting one or 

10 more attachment groups of the polypeptide with an organic derivatizing agent. Suitable deri- 
vatizing agents and methods are well known in the art. For example, cysteinyl residues most 
commonly are reacted with oc-haloacetates (and corresponding amines), such as chloroacetic 
acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. Cys- 
teinyl residues also are derivatized by reaction with bromotrifluoroacetone, oc-bromo-(3-(4- 

15 imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl 
disulfide, methyl 2-pyridyl disulfide, p-chloromercutibenzoate, 2-chloromercuri-4- 
nitrophenol, or chloro-7-nitrobenzo-2-oxa-l,3-diazole. Bttstidyl residues are derivatized by 
reaction with diethylpyrocarbonateat, pH 5.5-7.0, because this agent is relatively specific for 
the histidyl side chain. Para-bromophenacyl bromide is also useful. The reaction is prefera- 

20 bly performed in 0.1 M sodium cacodylate at pH 6.0. Lysinyl and amino terminal residues 
are reacted with succinic or other carboxylic acid anhydrides. Derivatization with these 
agents has the effect of reversing the charge of the lysinyl residues. Other suitable reagents 
for derivatizing oc-amino-containing residues include imidoesters such as methyl picolinimi- 
date, pyridoxal phosphate, pyridoxal, chloroborohydride, trinitrobenzenesulfonic acid, O- 

25 methylisourea, 2,4-pentanedione and transaminase-catalyzed reaction with glyoxylate. Ar- 
ginyl residues are modified by reaction with one or several conventional reagents, among 
them phenylglyoxal, 2,3-butanedione, 1,2-cyclohexanedione and ninhydrin. Derivatization 
of arginine residues requires that the reaction be performed in alkaline conditions because of 
the high pKa of the guanidine functional group. 

30 Furthermore, these reagents may react with the groups of lysine as well as the ar- 

ginine guanidino group. Carboxyl side groups (aspartyl or glutamyl) are selectively modi- 
fied by reaction with carbodiimides (R-N=C=N-R 5 ), where R and R' are different alkyl 
groups, such as l-cyclohexyl-3-(2-morpholinyl-4-ethyl) carbodiimide or l-ethyl-3-(4- 
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azonia-4,4-dimethylpentyl) carbodiimide. Furthermore, aspartyl and glutamyl residues are 
converted to asparaginyl and glutaminyl residues by reaction with ammonium ions. 

Conjugation of a tagged polypeptide 
5 In an alternative embodiment the polypeptide is expressed as a fusion protein with a 

tag, i.e. an amino acid sequence or peptide stretch made up of typically 1-30, such as 1-20 
amino acid residues. Besides allowing for fast and easy purification, the tag is a convenient 
tool for achieving conjugation between the tagged polypeptide and the non-polypeptide 
moiety. In particular, the tag may be used for achieving conjugation in microliter plates or 

10 other carriers, such as paramagnetic beads, to which the tagged polypeptide can be immobi- 
lised via the tag. The conjugation to the tagged polypeptide in e.g. microtiter plates has the 
advantage that the tagged polypeptide can be immobilised in the microtiter plates directly 
from the culture broth (in principle without any purification) and subjected to conjugation. 
Thereby, the total number of process steps (from expression to conjugation) can be reduced. 

15 Furthermore, the tag may function as a spacer molecule ensuring an improved accessibility 
to the immobilised polypeptide to be conjugated. The conjugation using a tagged polypep- 
tide may be to any of the non-polypeptide moieties disclosed herein, e.g. to a polymer mole- 
cule such as PEG. 

The identity of the specific tag to be used is not critical as long as the tag is capable 
20 of being expressed with the polypeptide and is capable of being immobilised on a suitable 
surface or carrier material. A number of suitable tags are commercially available, e.g. from 
Unizyme Laboratories, Denmark. For instance, the tag may consist of any of the following 
sequences (SEQ ID NOS: 9-13, respectively): 

25 His-His-His-His-His-His 

Met-Lys-His-His-His-His-His-His 
Met-Lys-His-His-Ala-His-His-Gln-His-His 
Met-Lys-ms-Gln-ffis-Gln-His-Gln-His-Gln-His-Gln-His-Gln 
Met-Lys-ffis-Gln-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-Gln 

30 

or any of (SEQ ID NOS: 14-16, respectively): 

EQKLI SEEDL (a C-terminal tag described in Mol Cell Biol 5:3610-16, 1985) 
D YKDDDDK (a C- or N-terminal tag) 
35 YPYDVPDYA 
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Antibodies against the above tags are commercially available, e.g. from Alpha Di- 
agnostic International, Inc., USA, and from Aves Lab, Inc., USA. 

A convenient method for using a tagged polypeptide for PEGylation is given in the 
Materials and Methods section below. The subsequent cleavage of the tag from the polypep- 
5 tide may be achieved by use of commercially available enzymes. 

Methods for preparing the polypeptide of the conjugate of the invention 

The polypeptide of the present invention or the polypeptide part of a conjugate of 
the invention, optionally in glycosylated form, may be produced by any suitable method 

10 known in the art. Such methods include constructing a nucleotide sequence encoding the 
polypeptide and expressing the sequence in a suitable transformed or transfected host. How- 
ever, polypeptides of the invention may be produced, albeit less efficiently, by chemical 
synthesis or a combination of chemical synthesis or a combination of chemical synthesis and 
recombinant DNA technology. 

15 The invention thus encompasses a method for preparing a single-chain multimeric 

polypeptide or polypeptide conjugate as disclosed herein, comprising culturing a recombi- 
nant host cell comprising a single nucleotide sequence encoding said polypeptide in a suit- 
able culture medium under conditions permitting expression of the nucleotide sequence, and 
recovering the resulting polypeptide from the cell culture, followed, where appropriate, by 

20 reacting the polypeptide with a polymer molecule or other non-polypeptide moiety under 
conditions permitting conjugation to take place so as to result in a polypeptide conjugate, 
and recovering the conjugate. 

The nucleotide sequence encoding a multimeric polypeptide of the invention, or the 
polypeptide part of a conjugate of the invention, may be constructed by isolating or synthe- 

25 sizing a nucleotide sequence encoding the parent polypeptide, and then changing the nucleo- 
tide sequence so as to effect introduction (i.e. insertion or substitution) or deletion (i.e. re- 
moval or substitution) of the relevant amino acid residue(s). The nucleotide sequence may 
be conveniently modified by site-directed mutagenesis in accordance with conventional 
methods. Alternatively, the nucleotide sequence may be prepared by chemical synthesis, e.g. 

30 by using an oligonucleotide synthesizer, wherein oligonucleotides are designed based on the 
amino acid sequence of the desired polypeptide, and preferably selecting those codons that 
are favored in the host cell in which the recombinant polypeptide will be produced. For ex- 
ample, several small oligonucleotides coding for portions of the desired polypeptide may be 
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synthesized and assembled by PCR, ligation or ligation chain reaction (LCR) (Barany, 
PNAS 88:189-193, 1991). The individual oligonucleotides typically contain 5' or 3' over- 
hangs for complementary assembly. 

Suitable mutations may be introduced by, e.g., site-directed mutagenesis as de- 
5 scribed by Sambrook et al. (Molecular Cloning: A Laboratory Manual, Second Edition, 
1989), or by random mutagenesis or DNA shuffling, e.g. as described below followed by 
screening for sequences coding for polypeptides with the desired activity. Screening may be 
carried out by an assay method as described below. 

Random mutagenesis (whether performed in the whole nucleotide sequence or one 
10 or more selected regions thereof) may be performed by any suitable method. For example, 
random mutagenesis is performed using a suitable physical or chemical mutagenizing agent, 
a suitable oligonucleotide, PCR generated mutagenesis or any combination of these 
mutagenizing agents/methods according to state of the art technology, e.g. as disclosed in 
WO 97/07202. 

15 Error prone PCR generated mutagenesis, e.g. as described by J.O. Deshler (1992), 

GATA 9(4): 103-106 and Leung et al., Technique (1989) Vol. 1, No. 1, pp. 11-15, is particu- 
larly useful for mutagenesis of longer peptide stretches (corresponding to nucleotide se- 
quences containing more than 100 bp) or entire genes, and is preferably performed under 
conditions that increase the misincorporation of nucleotides. 

20 Random mutagenesis based on doped or spiked oligonucleotides is of particular use 

for mutagenesis of one or more regions containing shorter nucleotide sequences (normally 
containing less than 100 nucleotides per region). Mutagenesis of several regions is conven- 
iently conducted by using several doped oligonucleotides and combining them by PCR. 
Doped or spiked oligonucleotides may also be used for random mutagenesis of nucleotide 

25 sequences encoding longer peptide stretches or entire genes when it is desirable to be able to 
control the random mutagenesis to a higher extent than what is possible with error prone 
PCR generated mutagenesis. 

Conveniently, random mutagenesis of one or more selected regions of a nucleotide 
sequence encoding the polypeptide of interest is performed using PCR generated mutagene- 

30 sis, in which one or more suitable oligonucleotide probes which flank the area to be 

mutagenized are used. Preferably, for mutagenesis of selected peptide stretches doped or 
spiked oligonucleotides is used. The doping or spiking can be designed to introduce any 
kind of amino acid residue and/or to avoid a codon for an unwanted amino acid residue (by 
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lowering the amount of or completely avoiding the nucleotides resulting in this codon). The 
doping may be designed on the basis of the skilled person's intelligent consideration of nu- 
cleotide doping (in accordance with generally known principles), by use of a suitable algo- 
rithm, e.g. a computer program which is based on the algorithm described by Siderovski DP 
5 and Mak TW, Comput. Biol Med. (1993) Vol. 23, No. 6, pp. 463-474 or Jensen et al. Nu- 
cleic Acids Research, 1998, Vol. 26, No. 3 or by using trinucleotides (Sondek, J. and 
Shortle, D., Proa Natl Acad. Sci, USA, Vol. 89, pp. 3581-3585, April 1992; Kayushin et al., 
Nucleic Acids Research, 1996, Vol. 24, No. 19, pp. 3748-3755; Virnekas et al., Nucleic Ac- 
ids Research 1994, Vol. 22, No. 25; WO 93/21203). The doped or spiked oligonucleotide 

10 can be incorporated into the nucleotide sequence encoding the polypeptide of interest by any 
published technique using e.g. PCR, LCR or any DNA polymerase or ligase. 

Random mutagenesis may be performed in two, three, four, five, six or more re- 
gions at the same time by synthesizing doped oligonucleotides covering each region and 
assembling the oligonucleotides by state of the art technologies, for example by a PCR 

15 method. One convenient PCR method involves a PCR reaction wherein the nucleotide se- 
quence encoding the polypeptide of interest is used as a template and the doped oligonucleo- 
tides are used as primers. In addition, cloning primers localized outside the targeted regions 
may be used. The resulting PCR product can either be directly cloned into an appropriate 
expression vector or gel purified and amplified in a second PCR reaction using the cloning 

20 primers and cloned into an appropriate expression vector. 

Besides substitutions the random mutagenesis may also cover random introduction 
of insertions or deletions. Preferably, the insertions are made so as to be in reading frame, 
e.g. by performing multiple introduction of three nucleotides as described by Hallet et al., 
Nucleic Acids Res. 1997, 25(9):1866-7 and Sondek and Shortle, PNAS USA 1992, 

25 89(8):3581-5. 

The nucleotide sequence(s) or nucleotide sequence region(s) to be mutagenized are 
typically present on a suitable vector such as a plasmid or a bacteriophage, which as such is 
incubated with or otherwise exposed to the mutagenizing agent. The nucleotide sequence(s) 
to be mutagenized may also be present in a host cell either by being integrated into the ge- 
30 nome of said cell or by being present on a vector harboured in the cell. Alternatively, the 
nucleotide sequence to be mutagenized is in isolated form. The nucleotide sequence is pref- 
erably a DNA sequence such as a cDNA, genomic DNA or synthetic DNA sequence. 
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In one embodiment the random mutagenesis is accompanied by conjugation to a 
non-polypeptide moiety. More specifically, a modified conjugated single-chain polypeptide 
of the invention may be prepared by 

a) expressing a random mutagenized library of nucleotide sequences encoding a 
5 parent polypeptide in single-chain form, 

b) conjugating one or more non-polypeptide moieties to the polypeptide variants 
expressed in step a), 

c) screening the resulting conjugates for agonist activity or receptor-binding, 

d) selecting polypeptide conjugates having such capability, and 

10 e) optionally subjecting the nucleotide sequence encoding the polypeptide part of 

a polypeptide conjugate selected in step d) to one or more repeated cycles of steps a)-d). 

The above method for random mutagenesis and conjugation is further described in 
PCT/DK00/00371. 

When using random mutagenesis as outlined above, the expression step a) can be 

15 conducted in any suitable manner, and conveniently as described further below. Suitably, the 
random mutagenized library is prepared by subjecting a nucleotide sequence encoding the 
parent polypeptide in single-chain form to random mutagenesis so as to create a large num- 
ber of mutated nucleotide sequences. The random mutagenesis may be entirely random, both 
with respect to where in the nucleotide sequence the mutagenesis occurs and with respect to 

20 the nature of mutagenesis. 

Alternatively, the random mutagenesis may be conducted so as to randomly mutate 
one or more selected regions of the polypeptide, in particular a receptor-binding site thereof. 
The library is typically present in a host cell, from which expression is achieved. Of particu- 
lar interest is a host cell which is capable of a reasonable transformation frequency such as 

25 bacterium, e.g. E. coli, yeast, e.g. S. cereviciae, or fungus. Alternatively, a high throughput 
transfection system of mammalian cells or other cells capable of a desirable post- 
translational modification (such as in vivo glycosylation) may be employed, for example 
using CHO (Chinese Hamster Ovary), COS or BHK (Baby Hamster Kidney) cells. 

Conjugation step b) is conveniently conducted as described above in connection 

30 with conjugation to a polymer or an oligosaccharide moiety. The screening step c) is an im- 
portant element of the method according to this embodiment of the invention. The screening 
is conveniently conducted as a primary screening for activating or receptor-binding capabil- 
ity, e.g. based on the principles disclosed in the section entitled "Assay". 
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In a preferred embodiment as many as possible of steps a-d) are performed in a 
high throughput screening system. In particular, it is preferred that steps a)-d) are performed 
in a robotized system, wherein the expression from the random mutagenized library of nu- 
cleotide sequences is achieved in microtiter plates, the resulting supernatant is transferred to 
5 a different microtiter plate, preferably under conditions allowing immobilization of the 
polypeptides, and optionally under conditions where the receptor-binding site of the poly- 
peptide is blocked, e.g. by a suitable receptor, receptor analogue or antibody, and/or the 
polypeptide is provided with a tag, e.g. of the type described above. The optionally immobi- 
lized, blocked and/or tagged polypeptides are subjected to conjugation to a non-polypeptide 
10 moiety while present in the microtiter plate and the resulting polypeptide conjugates present 
in the microtiter plate are subjected to the relevant screening. Subsequently, selected posi- 
tive polypeptide conjugates are subjected to further characterization, including secondary 
screening. 

Once assembled, the nucleotide sequence encoding the polypeptide is inserted into 
15 a recombinant vector and operably linked to control sequences necessary for expression of 
the polypeptide in the desired transformed host cell. 

In a preferred embodiment the polypeptide conjugate can be prepared in a high 
throughput screening system allowing production and screening of a high number of differ- 
ent polypeptides in a short time. This is in particular suitable in the following situations: 
20 • obtaining an improved binding affinity 

• altering receptor specificity 

• reducing/eliminating possible antagonist activity 

• identifying optimal linkers. 

Nucleotide sequence modification methods suitable for producing polypeptide vari- 
25 ants for high throughput screening further include for instance methods which involve ho- 
mologous cross-over such as disclosed in US 5,093,257, and methods which involve gene 
shuffling, i.e. recombination between two or more homologous nucleotide sequences result- 
ing in new nucleotide sequences having a number of nucleotide alterations when compared 
to the starting nucleotide sequences. Gene shuffling (also known as DNA shuffling) involves 
30 one or more cycles of random fragmentation and reassembly of the nucleotide sequences, 
followed by screening to select nucleotide sequences encoding polypeptides with desired 
properties. In order for homology-based nucleic acid shuffling to take place, the relevant 
parts of the nucleotide sequences are preferably at least 50% identical, such as at least 60% 
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identical, more preferably at least 70% identical, such as at least 80% identical. The recom- 
bination can be performed in vitro or in vivo. 

Examples of suitable in vitro gene shuffling methods are disclosed by Stemmer et 
al. (1994), Proc. Natl Acad. Set USA; vol. 91, pp. 10747-10751; Stemmer (1994), Nature, 
5 vol. 370, pp. 389-391; Smith (1994), Nature vol. 370, pp. 324-325; Zhao et al., Nat. Bio- 
technol 1998, Mar; 16(3): 258-61; Zhao H. and Arnold, KB, Nucleic Acids Research, 1997, 
Vol. 25. No. 6 pp. 1307-1308; Shao et al., Nucleic Acids Research 1998, Jan 15; 26(2): pp. 
681-83; and WO 95/17413. An example of a suitable in vivo shuffling method is disclosed 
in WO 97/07205. Other techniques for mutagenesis of nucleic acid sequences by in vitro or 

10 in vivo recombination are disclosed e.g. in WO 97/20078 and US 5,837,458. Examples of 
specific shuffling techniques include "family shuffling", "synthetic shuffling" and "in silico 
shuffling". Family shuffling involves subjecting a family of homologous genes from differ- 
ent species to one or more cycles of shuffling and subsequent screening or selection. Family 
shuffling techniques are disclosed e.g. by Crameri et al. (1998), Nature, vol. 391, pp. 288- 

15 291; Christians et al. (1999), Nature Biotechnology, vol. 17, pp. 259-264; Chang et al. 

(1999), Nature Biotechnology, vol. 17, pp. 793-797; and Ness et al. (1999), Nature Biotech- 
nology, vol. 17, 893-896. Synthetic shuffling involves providing libraries of overlapping 
synthetic oligonucleotides based e.g. on a sequence alignment of homologous genes of in- 
terest. The synthetically generated oligonucleotides are recombined, and the resulting re- 

20 combinant nucleic acid sequences are screened and if desired used for further shuffling cy- 
cles. Synthetic shuffling techniques are disclosed in WO 00/42561. In silico shuffling refers 
to a DNA shuffling procedure which is performed or modelled using a computer system, 
thereby partly or entirely avoiding the need for physically manipulating nucleic acids. Tech- 
niques for in silico shuffling are disclosed in WO 00/42560. 

25 It should of course be understood that not all vectors and expression control se- 

quences function equally well to express the nucleotide sequence encoding a polypeptide 
described herein. Neither will all hosts function equally well with the same expression sys- 
tem. However, one skilled in the art will be able to make a selection among these vectors, 
expression control sequences and hosts without undue experimentation. For example, in 

30 selecting a vector, the host must be considered because the vector must replicate in it or be 
able to integrate into the chromosome. The vector's copy number, the ability to control that 
copy number, and the expression of any other proteins encoded by the vector, such as anti- 
biotic markers, should also be considered. In selecting an expression control sequence, a 
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variety of factors should also be considered. These include, for example, the relative 
strength of the sequence, its controllability, and its compatibility with the nucleotide se- 
quence encoding the polypeptide, particularly as regards potential secondary structures. 
Hosts should be selected by consideration of their compatibility with the chosen vector, the 
5 toxicity of the product coded for by the nucleotide sequence, their secretion characteristics, 
their ability to fold the polypeptide correctly, their fermentation or culture requirements, and 
ease of purification of the products encoded by the nucleotide sequence. 

The recombinant vector may be an autonomously replicating vector, i.e. a vector 
which exists as an extrachromosomal entity, the replication of which is independent of 

10 chromosomal replication, e.g. a plasmid. Alternatively, the vector is one which, when intro- 
duced into a host cell, is integrated into the host cell genome and replicated together with the 
chromosome(s) into which it has been integrated. 

The vector is preferably an expression vector in which the nucleotide sequence en- 
coding the polypeptide of the invention is operably linked to additional segments required 

15 for transcription of the nucleotide sequence. The vector is typically derived from plasmid or 
viral DNA. A number of suitable expression vectors for expression in the host cells men- 
tioned herein are commercially available or described in the literature. Useful expression 
vectors for eukaryotic hosts include, for example, vectors comprising expression control 
sequences from SV40, bovine papilloma virus, adenovirus and cytomegalovirus. Specific 

20 vectors are, e.g., pCDNA3.1(+)\Hyg (Invitrogen, Carlsbad, CA, USA) and pCI-neo 

(Stratagene, La Jolla, CA, USA). Useful expression vectors for yeast cells include the 2\x 
plasmid and derivatives thereof, the POT1 vector (US 4,931,373), the pJS037 vector de- 
scribed in Okkels, Ann. New York Acad. Sci. 782, 202-207, 1996, and pPICZ A, B or C (In- 
vitrogen). Useful vectors for insect cells include pVL941, pBG311 (Cate et al., "Isolation of 

25 the Bovine and Human Genes for Mullerian Inhibiting Substance and Expression of the 
Human Gene in Animal Cells", Cell, 45, pp. 685-98, 1986), pBluebac 4.5 and pMelbac 
(both available from Invitrogen). Useful expression vectors for bacterial hosts include 
known bacterial plasmids, such as plasmids from E. coli, including pBR322, pET3a and 
pET12a (both from Novagen Inc., WI, USA), wider host range plasmids, such as RP4, 

30 phage DNAs, e.g. the numerous derivatives of phage lambda, e.g. NM989, and other DNA 
phages, such as M13 and filamentous single stranded DNA phages. 

Other vectors for use in this invention include those that allow the nucleotide se- 
quence encoding the polypeptide to be amplified in copy number. Such amplifiable vectors 
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axe well known in the art. They include, for example, vectors able to be amplified by DHFR 
amplification (see, e.g., Kaufman, US 4,470,461, Kaufman and Sharp, "Construction Of A 
Modular Dihydrafolate Reductase cDNA Gene: Analysis Of Signals Utilized For Efficient 
Expression", Mol Cell Biol, 2, pp. 1304-19 (1982)) and glutamine synthetase ("GS") am- 
5 plification (see e.g. US 5,122,464 and EP 338,841). 

The recombinant vector may further comprise a DNA sequence enabling the vector 
to replicate in the host cell in question. An example of such a sequence (when the host cell is 
a mammalian cell) is the SV40 origin of replication. When the host cell is a yeast cell, suit- 
able sequences enabling the vector to replicate are the yeast plasmid 2jll replication genes 

10 REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene whose product com- 
plements a defect in the host cell, such as the gene coding for dihydrofolate reductase 
(DHFR) or the Schizosaccharomyces pombe TPI gene (described by P.R. Russell, Gene 40, 
1985, pp. 125-130), or one which confers resistance to a drug, e.g. ampicillin, kanamycin, 

15 tetracyclin, chloramphenicol, neomycin, hygromycin or methotrexate. For S. cerevisiae, 
selectable markers include ura3 and leu2. For filamentous fungi, selectable markers include 
amdS, pyrG, arcB, niaD and sC. 

The term "control sequences" is defined herein to include all components that are 
necessary or advantageous for the expression of the polypeptide of the invention. Each con- 

20 trol sequence may be native or foreign to the nucleic acid sequence encoding the polypep- 
tide. Such control sequences include, but are not limited to, a leader sequence, polyadenyla- 
tion sequence, propeptide sequence, promoter, enhancer or upstream activating sequence, 
signal peptide sequence, and transcription terminator. At a minimum, the control sequences 
include a promoter. 

25 A wide variety of expression control sequences may be used in the present inven- 

tion. Such useful expression control sequences include the expression control sequences 
associated with structural genes of the foregoing expression vectors as well as any sequence 
known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, 
and various combinations thereof. 

30 Examples of suitable control sequences for directing transcription in mammalian 

cells include the early and late promoters of SV40 and adenovirus, e.g. the adenovirus 2 
major late promoter, the MT-1 (metallothionein gene) promoter, the human cytomegalovirus 
immediate-early gene promoter (CMV), the human elongation factor la (EF-la) promoter, 
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the Drosophila minimal heat shock protein 70 promoter, the Rous Sarcoma Virus (RSV) 
promoter, the human ubiquitin C (UbC) promoter, the human growth hormone terminator, 
S V40 or adenovirus Elb region polyadenylation signals and the Kozak consensus sequence 
(Kozak, M. JMol Biol 1987 Aug 20;196(4):947-50). 
5 In order to improve expression in mammalian cells a synthetic intron may be in- 

serted in the 5' untranslated region of the nucleotide sequence encoding the polypeptide. An 
example of a synthetic intron is the synthetic intron from the plasmid pCI-Neo (available 
from Promega Corporation, WI, USA). 

Examples of suitable control sequences for directing transcription in insect cells in- 

10 elude the polyhedrin promoter, the P10 promoter, the Autographa califomica polyhedrosis 
virus basic protein promoter, the baculovirus immediate early gene 1 promoter, the bacu- 
lovirus 39K delayed-early gene promoter, and the S V40 polyadenylation sequence. Exam- 
ples of suitable control sequences for use in yeast host cells include the promoters of the 
yeast ot-mating system, the yeast triose phosphate isomerase (TPI) promoter, promoters 

15 from yeast glycolytic genes or alcohol dehydrogenase genes, the ADH2-4c promoter, and 
the inducible GAL promoter. Examples of suitable control sequences for use in filamentous 
fungal host cells include the ADH3 promoter and terminator, a promoter derived from the 
genes encoding Aspergillus oryzae TAKA amylase triose phosphate isomerase or alkaline 
protease, an A. niger a-amylase, A. niger or A. nidulans glucoamylase, A. nidulans acetami- 

20 dase, Rhizomucor miehei aspartic proteinase or lipase, the TPI1 terminator and the ADH3 
terminator. Examples of suitable control sequences for use in bacterial host cells include 
promoters of the lac system, the trp system, the TAC or TRC system, and the major promoter 
regions of phage lambda. 

The presence or absence of a signal peptide will e.g. depend on the expression host 

25 cell used for the production of the polypeptide to be expressed (whether it is an intracellular 
or extracellular polypeptide) and whether it is desirable to obtain secretion. For use in fila- 
mentous fungi, the signal peptide may conveniently be derived from a gene encoding an 
Aspergillus sp. amylase or glucoamylase, a gene encoding a Rhizomucor miehei lipase or 
protease or a Humicola lanuginosa lipase. The signal peptide is preferably derived from a 

30 gene encoding A. oryzae TAKA amylase, A. niger neutral a-amylase, A. niger acid-stable 
amylase, or A. niger glucoamylase. For use in insect cells, the signal peptide may conven- 
iently be derived from an insect gene (cf. WO 90/05783), such as the Lepidopteran manduca 
sexta adipokinetic hormone precursor, (cf. US 5,023,328), the honeybee melittin (Invitro- 
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gen), ecdysteroidUDPglucosyltransferase (egt) (Murphy et al., Protein Expression and Pu- 
rification 4, 349-357 (1993) or human pancreatic lipase (hpl) {Methods in Enzymology 284, 
pp. 262-272, 1997). A preferred signal peptide for use in mammalian cells is e.g. the murine 
Ig kappa light chain signal peptide (Coloma, M (1992) J. Imm. Methods 152:89-104). For 
5 use in yeast cells, suitable signal peptides have been found to be the a-f actor signal peptide 
from S. cereviciae (cf. US 4,870,008), a modified carboxypeptidase signal peptide (cf. L.A. 
Vails et al., Cell 48, 1987, pp. 887-897), the yeast BAR1 signal peptide (cf. WO 87/02670), 
the yeast aspartic protease 3 (YAP3) signal peptide (cf. M. Egel-Mitani et al., Yeast 6, 1990, 
pp. 127-137), and the synthetic leader sequence TA57 (W098/32867). For use in E. coli 

10 cells a suitable signal peptide has been found to be the signal peptide ompA (EP 581 821). 

The nucleotide sequence of the invention encoding a polypeptide, whether prepared 
by site-directed mutagenesis, synthesis, PGR or other methods, may optionally also include 
a nucleotide sequence that encodes a signal peptide. The signal peptide is present when the 
polypeptide is to be secreted from the cells in which it is expressed. Such a signal peptide, if 

15 present, should be one recognized by the cell chosen for expression of the polypeptide. The 
signal peptide may be homologous (e.g. be that normally associated with the native peptide) 
or heterologous (i.e. originating from another source) to the polypeptide or may be homolo- 
gous or heterologous to the host cell, i.e. be a signal peptide normally expressed from the 
host cell or one which is not normally expressed from the host cell. Accordingly, the signal 

20 peptide may be prokaryotic, e.g. derived from a bacterium such as E. coli, or eukaryotic, e.g. 
derived from a mammalian, insect or yeast cell. 

Any suitable host may be used to produce the polypeptide or polypeptide part of the 
conjugate of the invention, including bacteria, fungi (including yeasts), plants, insects, 
mammals or other animals, or an appropriate animal cell line or another cell line. Examples 

25 of bacterial host cells include gram-positive bacteria such as strains of Bacillus, e.g. B. bre~ 
vis or B. subtilis, Pseudomonas or Streptomyces, or gram-negative bacteria such as strains of 
E. coli. The introduction of a vector into a bacterial host cell may, for instance, be effected 
by protoplast transformation (see e.g. Chang and Cohen, 1979, Molecular General Genetics 
168: 111-115), using competent cells (see e.g. Young and Spizizin, 1961, Journal ofBacte- 

30 riology 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, Journal of Molecular Biology 
56: 209-221), electroporation (see e.g. Shigekawa and Dower, 1988, Biotechniques 6: 742- 
751), or conjugation (see e.g. Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771- 
5278). 
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Examples of suitable filamentous fungal host cells include strains of Aspergillus, 
e.g. A. oryzae, A. niger or A. nidulans, Fusarium or Trichoderma. Fungal cells may be trans- 
formed by a process involving protoplast formation, transformation of the protoplasts, and 
regeneration of the cell wall in a manner known per se. Suitable procedures for transforma- 
5 tion of Aspergillus host cells are described in EP 238 023 and US 5,679,543. Suitable meth- 
ods for transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 
147-156 and WO 96/00787. Examples of suitable yeast host cells include strains of Sac- 
charomyces, e.g. S. cerevisiae, Schizosaccharomyces, Klyveromyces, Pichia, such as P. pas- 
toris or P. methanolica, Hansenula, such as H. polymorpha, or Yarrowia, Yeast may be 

10 transformed using the procedures described by Becker and Guarente, In Abelson, J.N. and 
Simon, M.I., editors, Guide to Yeast Genetics and Molecular Biology , Methods in Enzymol- 
ogy, Volume 194, pp. 182-187, Academic Press, Inc., New York; Ito et al., 1983, Journal of 
Bacteriology 153: 163; Hinnen et al., 1978, PNAS USA 75: 1920: and as disclosed by Clon- 
tech Laboratories, Inc., Palo Alto, CA, USA (in the product protocol for the Yeastmaker™ 

15 Yeast Transformation System Kit). 

Examples of suitable insect host cells include a Lepidoptora cell line, such as Spo- 
dopterafrugiperda (Sf9 or Sf21) or Trichoplusioa ni cells (High Five) (US 5,077,214). 
Transformation of insect cells and production of heterologous polypeptides therein may be 
performed as described by Invitrogen. Examples of suitable mammalian host cells include 

20 Chinese hamster ovary (CHO) cell lines, (e.g. CHO-K1; ATCC CCL-61), Green Monkey 
cell lines (COS) (e.g. COS 1 (ATCC CRL-1650), COS 7 (ATCC CRL-1651)); mouse cells 
(e.g. NS/O), Baby Hamster Kidney (BHK) cell lines (e.g. ATCC CRL-1632 or ATCC CCL- 
10), and human cells (e.g. HEK 293 (ATCC CRL-1573)), as well as plant cells in tissue cul- 
ture. 

25 Additional suitable cell lines are known in the art and available from public 

depositories such as the American Type Culture Collection, Rockville, Maryland, USA. 
Methods for introducing exogeneous DNA into mammalian host cells include calcium 
phosphate-mediated transfection, electroporation, DEAE-dextran mediated transfection, 
liposome-mediated transfection, viral vectors and the transfection method described by Life 

30 Technologies Ltd, Paisley, UK using Lipofectamin 2000. These methods are well known in 
the art and e.g. described by Ausbel et al. (eds.), 1996, Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, USA. The cultivation of mammalian cells is 
conducted according to established methods, e.g. as disclosed in: Animal Cell 
Biotechnology, Methods and Protocols, Edited by Nigel Jenkins, 1999, Human Press Inc., 
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and Protocols, Edited by Nigel Jenkins, 1999, Human Press Inc., Totowa, NJ, USA, and 
Harrison MA and Rae IF, General Techniques of Cell Culture, Cambridge University Press, 
1997. 

In the production methods of the present invention, the cells are cultivated in a nu- 
5 trient medium suitable for production of the polypeptide using methods known in the art. 
For example, the cell may be cultivated by shake flask cultivation, small-scale or large-scale 
fermentation (including continuous, batch, fed-batch, or solid state fermentations) in labora- 
tory or industrial fermentors performed in a suitable medium and under conditions allowing 
the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nu- 

10 trient medium comprising carbon and nitrogen sources and inorganic salts, using procedures 
known in the art. Suitable media are available from commercial suppliers or may be pre- 
pared according to published compositions (e.g. in catalogues of the American Type Culture 
Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be 
recovered directly from the medium. If the polypeptide is not secreted, it can be recovered 

15 from cell lysates. 

The resulting polypeptide may be recovered by methods known in the art. For ex- 
ample, the polypeptide may be recovered from the nutrient medium by conventional proce- 
dures including, but not limited to, centrifugation, filtration, extraction, spray drying, evapo- 
ration or precipitation. 

20 The polypeptides may be purified by a variety of procedures known in the art in- 

cluding, but not limited to, chromatography (e.g. ion exchange, affinity, hydrophobic, chro- 
matofocusing, and size exclusion), electrophoretic procedures (e.g. preparative isoelectric 
focusing), differential solubility (e.g. ammonium sulfate precipitation), SDS-PAGE, or ex- 
traction (see e.g. Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH Publish- 

25 ers, New York, 1989). Specific methods for purifying cytokine polypeptides are described in 
Human Cytokines, Handbook of Basic and Clinical Research, Volume n, Blackwell Sci- 
ence, Eds. Aggarwal and Gutterman, 1996, pp. 19-42. 

Pharmaceutical use and formulations 

The multimeric polypeptides and conjugates of the invention may be used for the 
30 manufacture of a medicament for treatment of diseases in mammals, in particular humans. 
The exact dose of a particular polypeptide or conjugate to be administered will depend on 
e.g. the disease, the administration schedule, whether it is administered alone or in conjunc- 



WO 02/36626 



PCT/DK01/00724 



56 

tion with other therapeutic agents, the serum half-life of the compositions, and the general 
health of the patient. The polypeptide or conjugate will advantageously be administered in 
an "effective amount", meaning an amount that prevents or alleviates, to any degree, or 
eliminates, the condition being treated. 
5 Pharmaceutical formulations of the polypeptide or conjugate of the invention are 

typically administered in a composition that includes one or more pharmaceutically accept- 
able carriers or excipients. Such pharmaceutical compositions may be prepared in a manner 
known per se in the art to result in a polypeptide pharmaceutical that is sufficiently storage- 
stable and is suitable for administration to humans or animals. 

10 

Drug form 

The polypeptide or conjugate of the invention can be used "as is" and/or in a salt 
form thereof. Suitable salts include, but are not limited to, salts with alkali metals or alkaline 
earth metals, such as sodium, potassium, calcium and magnesium, as well as e.g. zinc salts. 
15 These salts or complexes may by present as a crystalline and/or amorphous structure. 

Excipients 

"Pharmaceutically acceptable" means a carrier or excipient that at the dosages and 
concentrations employed does not cause any untoward effects in the patients to whom it is 
20 administered. Such pharmaceutically acceptable carriers and excipients are well known in 
the art (see Remington's Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack 
Publishing Company [1990]; Pharmaceutical Formulation Development of Peptides and 
Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis [2000] ; and Handbook of 
Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., Pharmaceutical Press [2000]). 

25 

Dose 

The polypeptides and conjugates of the invention will be administered to patients in 
a therapeutically effective dose. By "therapeutically effective dose" herein is meant a dose 
that is sufficient to produced the desired effects in relation to the condition for which it is 
30 administered. The exact dose will depend on the disorder to be treated, and will be ascer- 
tainable by one skilled in the art using known techniques. 
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Mix of drugs 

The pharmaceutical composition of the invention may be administered alone or in 
conjunction with other therapeutic agents. These agents may be incorporated as part of the 
same pharmaceutical composition or may be administered separately from the polypeptide 
5 or conjugate of the invention, either concurrently or in accordance with another treatment 
schedule. In addition, the polypeptide, conjugate or pharmaceutical composition of the in- 
vention may be used as an adjuvant to other therapies. 

Patients 

10 A "patient" for the purposes of the present invention includes both humans and 

other mammals. Thus the methods are applicable to both human therapy and veterinary ap- 
plications. 

Types of composition and administration route 

15 The pharmaceutical composition of the polypeptide or conjugate of the invention 

may be formulated in a variety of forms, e.g. as a liquid, gel, lyophilized, or as a compressed 
solid. The preferred form will depend upon the particular indication being treated and will 
be readily able to be determined by one skilled in the art. 

The administration of the formulations of the present invention can be performed in 

20 a variety of ways, including, but not limited to, subcutaneously, intravenously, orally, in- 
tracerebrally, intranasally, transdermally, intraperitoneally, intramuscularly, intrapulmonary, 
vaginally, rectally, intraocularly, or in any other acceptable manner. The formulations can be 
administered continuously by infusion, although bolus injection is acceptable, using tech- 
niques well known in the art, such as pumps or implantation. 

25 

Parenterals 

An example of a pharmaceutical composition is a solution designed for parenteral 
administration. Although in many cases pharmaceutical solution formulations are provided 
in liquid form, appropriate for immediate use, such parenteral formulations may also be pro- 
30 vided in frozen or in lyophilized form. The latter form is often used to enhance the stability 
of the active compound contained in the composition under a wider variety of storage condi- 
tions, as it is recognized by those skilled in the art that lyophilized preparations are generally 
more stable than their liquid counterparts. Such lyophilized preparations are reconstituted 
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prior to use by the addition of one or more suitable pharmaceutical^ acceptable diluents 
such as sterile water for injection or sterile physiological saline solution. 

In case of parenterals, they are prepared for storage as lyophilized formulations or 
aqueous solutions by mixing, as appropriate, the polypeptide having the desired degree of 
5 purity with one or more pharmaceutical^ acceptable carriers, excipients or stabilizers typi- 
cally employed in the art (all of which are termed "excipients"), for example buffering 
agents, stabilizing agents, preservatives, isotonifiers, non-ionic detergents, antioxidants 
and/or other miscellaneous additives. 

Buffering agents help to maintain the pH in the range which approximates physio- 

10 logical conditions. They are typically present at a concentration ranging from about 2 mM to 
about 50 mM Suitable buffering agents for use with the present invention include both or- 
ganic and inorganic acids and salts thereof such as citrate buffers (e.g., monosodium citrate- 
disodium citrate mixture, citric acid-trisodium citrate mixture, citric acid-monosodium cit- 
rate mixture, etc.), succinate buffers (e.g., succinic acid-monosodium succinate mixture, 

15 succinic acid-sodium hydroxide mixture, succinic acid-disodium succinate mixture, etc.), 
tartrate buffers (e.g., tartaric acid-sodium tartrate mixture, tartaric acid-potassium tartrate 
mixture, tartaric acid-sodium hydroxide mixture, etc.), fumarate buffers (e.g., fumaric acid- 
monosodium fumarate mixture, fumaric acid-disodium fumarate mixture, monosodium fu- 
marate-disodium fumarate mixture, etc.), gluconate buffers (e.g., gluconic acid-sodium gly- 

20 conate mixture, gluconic acid-sodium hydroxide mixture, gluconic acid-potassium glyucon- 
ate mixture, etc.), oxalate buffers (e.g., oxalic acid-sodium oxalate mixture, oxalic acid- 
sodium hydroxide mixture, oxalic acid-potassium oxalate mixture, etc.), lactate buffers (e.g., 
lactic acid-sodium lactate mixture, lactic acid-sodium hydroxide mixture, lactic acid- 
potassium lactate mixture, etc.) and acetate buffers (e.g., acetic acid-sodium acetate mixture, 

25 acetic acid-sodium hydroxide mixture, etc.). Additional possibilities are phosphate buffers, 
histidine buffers and trimethylamine salts such as Tris. 

Preservatives are added to retard microbial growth, and are typically added in 
amounts of e.g. about 0.1%~2% (w/v). Suitable preservatives for use with the present inven- 
tion include phenol, benzyl alcohol, meta-cresol, methyl paraben, propyl paraben, octade- 

30 cyldimethylbenzyl ammonium chloride, benzalkonium halides (e.g. benzalkonium chloride, 
bromide or iodide), hexamethonium chloride, alkyl parabens such as methyl or propyl para- 
ben, catechol, resorcinol, cyclohexanol and 3-pentanol. 
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Isotonicifiers are added to ensure isotonicity of liquid compositions and include 
polyhydric sugar alcohols, preferably trihydric or higher sugar alcohols, such as glycerin, 
erythritol, arabitol, xylitol, sorbitol and mannitol. Polyhydric alcohols can be present in an 
amount between 0.1% and 25% by weight, typically 1% to 5%, taking into account the rela- 
5 tive amounts of the other ingredients. 

Stabilizers refer to a broad category of excipients which can range in function from 
a bulking agent to an additive which solubilizes the therapeutic agent or helps to prevent 
denaturation or adherence to the container wall. Typical stabilizers can be polyhydric sugar 
alcohols (enumerated above); amino acids such as arginine, lysine, glycine, glutamine, as- 

10 paragine, histidine, alanine, ornithine, L-leucine, 2-phenylalanine, glutamic acid, threonine, 
etc., organic sugars or sugar alcohols, such as lactose, trehalose, stachyose, mannitol, sorbi- 
tol, xylitol, ribitol, myoinisitol, galactitol, glycerol and the like, including cyclitols such as 
inositol; polyethylene glycol; amino acid polymers; sulfur-containing reducing agents, such 
as urea, glutathione, thioctic acid, sodium thioglycolate, thioglycerol, oc-monothioglycerol 

15 and sodium thiosulfate; low molecular weight polypeptides (i.e. <10 residues); proteins such 
as human serum albumin, bovine serum albumin, gelatin or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; monosaccharides such as xylose, mannose, fructose 
and glucose; disaccharides such as lactose, maltose and sucrose; trisaccharides such as raffi- 
nose, and polysaccharides such as dextran. Stabilizers are typically present in the range of 

20 from 0.1 to 10,000 parts by weight based on the active protein weight. 

Non-ionic surfactants or detergents (also known as "wetting agents") may be pre- 
sent to help solubilize the therapeutic agent as well as to protect the therapeutic polypeptide 
against agitation-induced aggregation, which also permits the formulation to be exposed to 
shear surface stress without causing denaturation of the polypeptide. Suitable non-ionic sur- 

25 factants include polysorbates (20, 80, etc.), polyoxamers (184, 188 etc.), Pluronic® polyols, 
polyoxyethylene sorbitan monoethers (Tween®-20, Tween®-80, etc.). 

Additional miscellaneous excipients include bulking agents or fillers (e.g. starch), 
chelating agents (e.g. EDTA), antioxidants (e.g., ascorbic acid, methionine, vitamin E) and 
cosol vents. 

30 The active ingredient may also be entrapped in microcapsules prepared, for exam- 

ple, by coascervation techniques or by interfacial polymerization, for example hydroxy- 
methylcellulose, gelatin or poly-(methylmethacylate) microcapsules, in colloidal drug deliv- 
ery systems (for example liposomes, albumin microspheres, microemulsions, nano-particles 
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and nanocapsules) or in macroemulsions. Such techniques are disclosed in Remington's 
Pharmaceutical Sciences, supra. 

Parenteral formulations to be used for in vivo administration must be sterile. This is 
readily accomplished, for example, by filtration through sterile filtration membranes. 

5 

Sustained release preparations 

Suitable examples of sustained-release preparations include semi-permeable matri- 
ces of solid hydrophobic polymers containing the polypeptide or conjugate, the matrices 
having a suitable form such as a film or microcapsules. Examples of sustained-release ma- 

10 trices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate) or 
poly(vinylalcohol)), polylactides, copolymers of L-glutamic acid and ethyl-L-glutamate, 
non-degradable ethylene- vinyl acetate, degradable lactic acid-glycolic acid copolymers such 
as the ProLease® technology or Lupron Depot® (injectable microspheres composed of lac- 
tic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric 

15 acid. While polymers such as ethylene- vinyl acetate and lactic acid-glycolic acid enable re- 
lease of molecules for long periods such as up to or over 100 days, certain hydrogels release 
proteins for shorter time periods. When encapsulated polypeptides remain in the body for a 
long time, they may denature or aggregate as a result of exposure to moisture at 37°C, 
resulting in a loss of biological activity and possible changes in immunogenicity. Rational 

20 strategies can be devised for stabilization depending on the mechanism involved. For exam- 
ple, if the aggregation mechanism is discovered to be intermolecular S-S bond formation 
through thio-disulfide interchange, stabilization may be achieved by modifying sulfhydryl 
residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate 
additives, and developing specific polymer matrix compositions. 

25 

Pulmonary delivery 

Formulations suitable for use with a nebulizer, either jet or ultrasonic, will typically 
comprise the polypeptide or conjugate dissolved in water at a concentration of, e.g., about 
0.01 to 25 mg of conjugate per mL of solution, preferably about 0.1 to 10 mg/mL. The for- 
30 mulation may also include a buffer and a simple sugar (e.g., for protein stabilization and 
regulation of osmotic pressure), and/or human serum albumin ranging in concentration from 
0.1 to 10 mg/ml. Examples of buffers that may be used are sodium acetate, citrate and gly- 
cine. Preferably, the buffer will have a composition and molarity suitable to adjust the solu- 
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tion to a pH in the range of 3 to 9. Generally, buffer molarities of from 1 mM to 50 mM are 
suitable for this purpose. Examples of sugars which can be utilized are lactose, maltose, 
mannitol, sorbitol, trehalose, and xylose, usually in amounts ranging from 1% to 10% by 
weight of the formulation. 
5 The nebulizer formulation may also contain a surfactant to reduce or prevent sur- 

face induced aggregation of the protein caused by atomization of the solution in forming the 
aerosol. Various conventional surfactants can be employed, such as polyoxyethylene fatty 
acid esters and alcohols, and polyoxyethylene sorbitan fatty acid esters. Amounts will gen- 
erally range between 0.001% and 4% by weight of the formulation. An especially preferred 

10 surfactant for purposes of this invention is polyoxyethylene sorbitan monooleate. 

Specific formulations and methods of generating suitable dispersions of liquid par- 
ticles of the invention are described in WO 94/20069, US 5,915,378, US 5,960,792, US 
5,957,124, US 5,934,272, US 5,915,378, US 5,855,564, US 5,826,570 and US 5,522,385, 
which are hereby incorporated by reference. 

15 Formulations for use with a metered dose inhaler device will generally comprise a 

finely divided powder. This powder may be produced by lyophilizing and then milling a 
liquid conjugate formulation and may also contain a stabilizer such as human serum albumin 
(HSA). Typically, more than 0.5% (w/w) HSA is added. Additionally, one or more sugars or 
sugar alcohols may be added to the preparation if necessary. Examples include lactose mal- 

20 tose, mannitol, sorbitol, sorbitose, trehalose, xylitol, and xylose. The amount added to the 
formulation can range from about 0.01 to 200% (w/w), preferably from approximately 1 to 
50%, of the conjugate present. Such formulations are then lyophilized and milled to the de- 
sired particle size. 

The properly sized particles are then suspended in a propellant with the aid of a sur- 
25 factant. The propellant may be any conventional material employed for this purpose, such as 
a chlorofluorocarbon, a hydrochlorofluorocarbon, a hydrofluorocarbon, or a hydrocarbon, 
including trichlorofluoromethane, dichlorodifluoromethane, dichlorotetrafluoroethanol, and 
1,1,1,2-tetrafluoroethane, or combinations thereof. Suitable surfactants include sorbitan tri- 
oleate and soya lecithin. Oleic acid may also be useful as a surfactant. This mixture is then 
30 loaded into the delivery device. 

Formulations for powder inhalers will comprise a finely divided dry powder con- 
taining conjugate and may also include a bulking agent, such as lactose, sorbitol, sucrose, or 
mannitol in amounts which facilitate dispersal of the powder from the device, e.g., 50% to 



WO 02/36626 



PCT/DK01/00724 



62 

90% by weight of the formulation. The particles of the powder shall have aerodynamic 
properties in the lung corresponding to particles with a density of about 1 g/cm 2 having a 
median diameter less than 10 micrometers, preferably between 0.5 and 5 micrometers, most 
preferably of between 1.5 and 3.5 micrometers. 
5 The powders for these devices may be generated and/or delivered by methods dis- 

closed in US 5,997,848, US 5,993,783, US 5,985,248, US 5,976,574, US 5,922,354, US 
5,785,049 and US 5,654,007. 

Mechanical devices designed for pulmonary delivery of therapeutic products, in- 
clude but are not limited to nebulizers, metered dose inhalers, and powder inhalers, all of 

10 which are familiar to those of skill in the art. Specific examples of commercially available 
devices suitable for the practice of this invention are the Ultravent™ nebulizer, manufac- 
tured by Mallinckrodt, Inc., St. Louis, Missouri; the Acorn™ II nebulizer, manufactured by 
Marquest Medical Products, Englewood, Colorado; the Ventolin™ metered dose inhaler, 
manufactured by Glaxo Inc., Research Triangle Park, North Carolina; the Spinhaler™ pow- 

15 der inhaler, manufactured by Fisons Corp., Bedford, Massachusetts; the "standing cloud" 
device of Inhale Therapeutic Systems, Inc., San Carlos, California; the AIR™ inhaler manu- 
factured by Alkermes, Cambridge, Massachusetts; and the AERxr™ pulmonary drug deliv- 
ery system manufactured by Aradigm Corporation, Hayward, California. 



20 EXAMPLES 

MATERIALS AND METHODS 



Methods used to determine amino acids to be modified 

Accessible Surface Area (ASA) 

The computer program Access (B. Lee and F.M.Richards, J. Mol.Biol. 55: 379-400 
25 (1971)) version 2 (©1983 Yale University) are used to compute the accessible surface area 
(ASA) of the individual atoms in the structure. This method typically uses a probe-size of 
1.4A and defines the Accessible Surface Area (ASA) as the area formed by the centre of the 
probe. Prior to this calculation all water molecules and all hydrogen atoms should be re- 
moved from the coordinate set, as should other atoms not directly related to the protein. 



30 
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Fractional AS A of side chain 

The fractional ASA of the side chain atoms is computed by division of the sum of 
the ASA of the atoms in the side chain by a value representing the ASA of the side chain 
atoms of that residue type in an extended ALA-x-ALA tripeptide. See Hubbard, Campbell & 
5 Thornton (1991) J.Mol.BioL 220, 507-530. For this example the CA atom is regarded as a 
part of the side chain of glycine residues but not for the remaining residues. The following 
values are used as standard 100% ASA for the side chain: 
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Residues not detected in the structure are defined as having 100% exposure as they 
10 are thought to reside in flexible regions. 

Determining distances between atoms 

The distance between atoms is most easily determined using molecular graphics 
software, e.g. Insightn v. 98.0, MSI Inc. 

15 

General considerations regarding amino acid residues to be modified 

As explained above, amino acid residues to be modified in accordance with the pre- 
sent invention are preferably those whose side chains are surface exposed, in particular those 
with more than about 25% of the side chain exposed at the surface of the molecule, and 

20 more preferably those with more than 50% side chain exposure. Another consideration is 
that residues located in receptor interfaces are preferably excluded so as to avoid or at least 
minimize possible interference with receptor binding or activation. A further consideration 
is that residues that are less than 10A from the nearest Lys (Glu, Asp) CB-CB (CA for Gly) 
should also be excluded. Finally, preferred positions for modification are in particular those 

25 that have a hydrophilic and/or charged residue, i.e. Asp, Asn, Glu, Gin, Arg, His, Tyr, Ser 
and Thr, positions that have an arginine residue being especially preferred. 
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Identifying G-CSF amino acid residues for modification 

Taking G-CSF as an example, the information below illustrates the factors that 
generally should be taken into consideration when identifying amino acid residues to be 
5 modified in accordance with the present invention. Based on these considerations with re- 
spect to side chain exposure, residues located within and outside of the receptor interface, 
distance between atoms, and whether residues are charged and/or hydrophilic, it is possible 
to select amino acid residues that are suitable for modification in a given polypeptide to ob- 
tain a desired result. 

10 Three-dimensional structures have been reported for human G-CSF by X-ray crys- 

tallography and by NMR spectroscopy, respectively (Proc. Natl Acad. Set USA 90:5167- 
5171, 1993; Biochemistry 33:8453-8463, 1994). The X-ray structure of a complex between 
G-CSF and the BN-BC domains of the GCSFR receptor have been reported in Nature 
401:713-717, 1999. Also, a 3D ensemble of 10 structures determined by NMR spectroscopy 

15 (Proc. Natl. Acad. Sci. USA 90:5167-5171, 1993) is available from the Protein Data Bank 
(PDB) (Bernstein et al., /. Mol Biol (1977) 112, pp. 535). 

Aritomi et al., Nature 401:713-717, 1999 have described the X-ray structure of a 
complex between hG-CSF and the BN-BC domains of the G-CSF receptor. They identify 
the following hG-CSF residues as being part of the receptor binding interfaces: G4, P5, A6, 

20 S7, S8, L9, P10, Qll, S12, L15, K16, E19, Q20, L108, D109, D112, T115, T116, Q119, 
E122, E123, and L124. Thus, although it is possible to modify these residues, it is preferred 
that these residues are excluded from modification. 

Using the 10 NMR structures of G-CSF identified above as input structures fol- 
lowed by a computation of the average ASA of the side chain, the following residues have 

25 been identified as having more than 25% ASA: M0, Tl, P2, L3, G4, P5, A6, S7, S8, L9, 
P10, Qll, S12, F13, L14, L15, K16, C17, E19, Q20, V21, R22, K23, Q25, G26, D27, A29, 
A30, E33, K34, C36, A37, T38, Y39, K40, L41, H43, P44, E45, E46, V48, L49, L50, H52, 
S53, L54, 156, P57, P60, L61, S62, S63, P65, S66, Q67, A68, L69, Q70, L71, A72, G73, 
C74, S76, Q77, L78, S80, F83, Q86, G87, Q90, E93, G94, S96, P97, E98, L99, G100, P101, 

30 T102, D104, T105, Q107, L108, D109, Alll, D112, F113, T115, T116, W118, Q119, 
Q120, M121, E122, E123, L124, M126, A127, P128, A129, L130, Q131, P132, T133, 
Q134, G135, A136, M137, P138, A139, A141, S142, A143, F144, Q145, R146, R147, 
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S155, H156, Q158, S159, L161, E162, V163, S164, Y165, R166, V167, L168, R169, H170, 
L171, A172, Q173,P174. 

Similarly, the following residues have more than 50% ASA: MO, Tl, P2, L3, G4, 
P5, A6, S7, S8, L9, P10, Qll, S12, F13, L14, L15, K16, C17, E19, Q20, R22, K23, G26, 
5 D27, A30, E33, K34, T38, K40, L41, H43, P44, E45, E46, L49, L50, S53, P57, P60, L61, 
S62, S63, P65, S66, Q67, A68, L69, Q70, L71, A72, G73, S80, F83, Q90, G94, P97, E98, 
P101, D104, T105, L108, D112, F113, T115, T116, Q119, Q120, E122, E123, L124, M126, 
P128, A129, L130, Q131, P132, T133, Q134, G135, A136, A139, A141, S142, A143, F144, 
R147, S155, S159, E162, R166, V167, R169, H170, L171, A172, Q173, P174. 

10 The molecular graphics program Insightn v. 9 8.0 was used to determine residues 

having their CB atom (CA in the case of glycine) at a distance of more than 15 A from the 
nearest amine group, defined as the NZ atoms of lysine and the N atom of the N-terminal 
residue Tl. The following list includes the residues that fulfill this criteria in at least one of 
the 10 NMR structures. G4, P5, A6, S7, S8, L9, P10, Qll, L14, L15, L18, V21, R22, Q25, 

15 G26, D27, G28, A29, Q32, L35, C36, T38, Y39, C42, H43, P44, E45, E46, L47, V48, L49, 
L50, G51, H52, S53, L54, G55, 156, P57, W58, A59, P60, L61, S62, S63, C64, P65, S66, 
Q67, A68, L69, Q70, L71, A72, G73, C74, L75, S76, Q77, L78, H79, S80, G81, L82, F83, 
L84, Y85, Q86, G87, L88, L89, Q90, A91, L92, E93, G94, 195, S96, P97, E98, L99, G100, 
P101, T102, L103, D104, T105, L106, Q107, L108, D109, V110, Alll, D112, F113, A114, 

20 T115, T116, 1117, W118, Q119, Q120, M121, E122, E123, L124, G125, M126, A127, 

P128, A129, L130, Q131, P132, T133, Q134, G135, A136, M137, P138, A139, F140, A141, 
S142, A143, F144, Q145, R146, R147, A148, G149, G150, V151, L152, V153, A154, S155, 
H156, L157, Q158, S159, F160, L161, E162, V163, S164, Y165, R166, V167, L168, R169, 
H170, L171, A172, Q173, P174. 

25 The Insightn v.98.0 program was similarly used to determine residues having their 

CB atom (CA atom in the case of glycine) at a distance of more than 10A from the nearest 
acidic group, defined as the CG atoms of aspartic acid, the CD atoms of glutamic acid and 
the C atom of the C-terminal residue PI 74. The following list includes the residues that ful- 
fill this criteria in at least one of the 10 NMR structures. M0, Tl, P2, L3, G4, P5, A6, S7, 

30 S8, L9, P10, Qll, S12, F13, L14, T38, Y39, K40, 141, C42, L50, G51, H52, S53, L54, G55, 
156, P57, W58, A59, P60, L61, S62, S63, C64, P65, S66, Q67, A68, L69, Q70, L71, A72, 
G73, C74, L75, S76, Q77, L78, H79, S80, G81, L82, F83, L84, Y85, Q86, G87, L88, 1117, 
M126, A127, P128, A129, L130, Q131, P132, T133, Q134, G135, A136, M137, P138, 
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A139, F140, A141, S142, A143, F144, Q145, R146, R147, A148, G149, G150, V151, L152, 
V153, A154, S155, H156, L157, V167, L168, R169, H170, L171. 

By combining and comparing the above lists (or similar lists prepared for other polypep- 
tides), it is possible to select individual amino acid residues for modification to result in a 
5 list containing a limited number of amino acid residues whose modification in a given poly- 
peptide is likely to result in desired properties. 

Methods for PEGvlation of single chain G-CSF dimer and variants thereof 

PEGylation of single chain G-CSF dimer and variants thereof in solution 

Single chain G-CSF dimer and variants thereof are PEGylated at a concentration of 

10 250 /xg/ml in 50 mM sodium phosphate, 100 mM NaCl, pH 8.5. The molar surplus of PEG 
is 100 times with respect to PEGylation sites on the protein. The reaction mixture is placed 
in a thermo mixer for 30 minutes at 37°C at 1200 rpm. After 30 minutes, quenching of the 
reaction is obtained by adding a molar excess of glycine. 

Cation exchange chromatography is applied to remove excess PEG, glycine and 

15 other by-products from the reaction mixture. The PEGylation reaction mixture is diluted 
with 20 mM sodium citrate pH 2.5 until the ionic strength is less than 7 mS/cm. pH is ad- 
justed to 2.5 using 5 N HC1. The mixture is applied to an SP-sepharose FF column equili- 
brated with 20 mM sodium citrate pH 2.5. Unbound material is washed off the column using 
4 column volumes of equilibration buffer. PEGylated protein is eluted in three column vol- 

20 umes by adding 20 mM sodium citrate, 750 mM sodium chloride. Pure PEGylated G-CSF 
is concentrated and buffer exchange is performed using VivaSpin concentration devices, 
molecular weight cut-off (mwco): 10 kDa. 

PEGylation in microtiter plates of a tagged polypeptide with single chain G-CSF dimer activity 
25 A polypeptide exhibiting single chain G-CSF dimer activity is expressed with a 

suitable tag, e.g. any of the tags exemplified in the general description above, and culture 
broth is transferred to one or more wells of a microtiter plate capable of immobilising the 
tagged polypeptide. When the tag is Met-Lys-His-Gln~His-Gln-His-Gln-His-Gln-His-Gln- 
His-Gln-Gln (SEQ ID NO: 13), a nickel-nitrilotriacetic acid (Ni-NTA) HisSorb microtiter 
30 plate commercially available from QIAGEN can be used. 

After immobilization of the tagged polypeptide to the microtiter plate, the wells are 
washed in a buffer suitable for binding and subsequent PEGylation followed by incubating 
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the wells with the activated PEG of choice. As an example, M-SPA-PEG 5000 from Shear- 
water Corp. is used. The molar ratio of activated PEG to polypeptide should be optimized, 
but will typically be greater than 10:1, e.g. up to about 100:1 or higher. After a suitable reac- 
tion time at ambient temperature, typically around 1 hour, the reaction is stopped by removal 
5 of the activated PEG solution. The conjugated protein is eluted from the plate by incubation 
with a suitable buffer. Suitable elution buffers may contain imidazole, excess NTA or an- 
other chelating compound. The conjugated protein is assayed for biological activity and im- 
munogenicity as appropriate. The tag may optionally be cleaved off using a method known 
in the art, e.g. using diaminopeptidase the Gin in pos -1 can be converted to pyroglutamyl 
10 with GCT (glutamylcyclotransf erase) and finally cleaved off with PGAP (pyro-glutamyl- 
aminopeptidase), giving the untagged protein. The process involves several steps of metal 
chelate affinity chromatography. Alternatively, the tagged polypeptide may be conjugated. 

Methods used to characterize conjugated and non-conjugated single chain G-CSF dimer and 
variants thereof 

15 Determination of the molecular size of single chain G-CSF dimer and variants thereof 

The molecular weight of conjugated or non-conjugated single chain G-CSF dimer 
or variants thereof is determined by either SDS-PAGE, gel filtration, matrix assisted laser 
desorption mass spectrometry or equilibrium centrifugation. As explained above, SDS- 
PAGE provides information on the "apparent molecular weight". The actual molecular 

20 weight can advantageously be determined using mass spectrometry. SDS-PAGE is carried 
out using the NuPAGE® kit (Novex high-performance pre-cast gels) from Invitrogen™. 
15 jllI of the samples are loaded onto NuPAGE® 4-12% Bis-Tris gels (Cat. No. NP0321) 
and eluted in NuPAGE® MES SDS running buffer (Cat. No. NPO002-02) for 35 minutes at 
200 V and 120 mA. 

25 

Determination of polypeptide concentration 

The concentration of a polypeptide can be measured using optical density meas- 
urements at 280 nm, an enzyme-linked immunoadsorption assay (ELIS A), a radio- 
immunoassay (RIA), or other such immunodetection techniques well known in the art. Fur- 
30 thermore, the polypeptide concentration in a sample can be measured with the Biacore® 
instrument using a Biacore® chip coated with an antibody specific for the polypeptide. 
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Such an antibody can be covalently coupled to the Biacore® chip by various chem- 
istries. Alternatively, the antibody can be non-covalently bound e.g. by means of an anti- 
body specific for the Fc portion of the anti-polypeptide antibody. The Fc specific antibody is 
first coupled covalently to the chip. The anti-polypeptide antibody is then flowed over the 
5 chip and is bound by the first antibody in a directed fashion. Furthermore, biotinylated anti- 
bodies can be immobilised using a streptavidin coated surface (e.g. Biacore Sensor Chip 
SA®) (Real-Time Analysis of Biomolecular Interactions, Nagata and Handa (Eds.), 2000, 
Springer Verlag, Tokyo; Biacore 2000 Instrument Handbook, 1999, Biacore AB). 

When the sample is flowed over the chip the polypeptide will bind to the coated an- 
10 tibody and the increase in mass can be measured. By using a preparation of the polypeptide 
in a known concentration, a standard curve can be established and subsequently the concen- 
tration of the polypeptide in the sample can be determined. After each injection of sample 
the sensor chip is regenerated by a suitable eluent (e.g. a low pH buffer) that removes the 
bound analyte. 

15 Generally, the applied antibodies will be monoclonal antibodies raised against the 

wild-type polypeptide. Introduction of mutations or other manipulations of the wild-type 
polypeptide (extra glycosylations or polymer conjugations) may alter the recognition by 
such antibodies. Furthermore, such manipulations that give rise to an increased molecular 
weight of the polypeptide will result in an increased plasmon resonance signal. Conse- 

20 quently, it is necessary to establish a standard curve for every molecule to be tested. 

Methods used to determine the in vitro and in vivo activity of conjugated and non- 
conjugated single chain G-CSF dimer and variants thereof 

Primary assay 1 — in vitro single chain G-CSF dimer activity assay 

Proliferation of the murine cell line NFS-60 (obtained from Dr. J. Ihle, St. Jude 

25 Children's Research Hospital, Tennessee, USA) is dependent on the presence of active sin- 
gle chain G-CSF dimer in the growth medium. Thus, the in vitro biological activity of single 
chain G-CSF dimer and variants thereof can be determined by measuring the number of di- 
viding NFS-60 cells after addition of a single chain G-CSF dimer sample to the growth me- 
dium followed by incubation over a fixed period of time. 

30 NFS-60 cells are maintained in Iscoves DME Medium containing 10% w/w FBS 

(fetal bovine serum), 1% w/w Pen/Strep, 10 \ig per litre hG-CSF and 2 mM Glutamax. Prior 
to sample addition, cells are washed twice in growth medium without hG-CSF and diluted to 
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a concentration of 2.2 x 10 5 cells per ml. 100 |Lil of the cell suspension is added to each well 
of a 96 well microtiter plate (Corning). 

Samples containing conjugated or non-conjugated single chain G-CSF dimer or 
variants thereof are diluted to concentrations between l.lxlO" 6 M and l.lxlO" 13 M in the 
5 growth medium. 10 \il of each sample is added to 3 wells containing NFS-60 cells. A control 
consisting of 10 |Ltl of mammalian growth medium is added to 8 wells on each microtiter 
plate. The cells are incubated for 48 hours (37°C, 5% C0 2 ) and the number of dividing cells 
in each well is quantified using the WST-1 cell proliferation agent (Roche Diagnostics 
GmbH, Mannheim, Germany). 0.01 ml WST-1 is added to the wells followed by incubation 
10 for 150 min. at 37°C in a 5% CO2 air atmosphere. The cleavage of the tetrazolium salt 

WST-1 by mitochondrial dehydrogenases in viable cells results in the formation of formazan 
that is quantified by measuring the absorbance at 450 nm. The number of viable cells in each 
well is hereby quantified. 

Based on these measurements, dose-response curves for each conjugated and non- 
15 conjugated single chain G-CSF dimer molecule or variants thereof are calculated, after 
which the EC50 value for each molecule can be determined. This value is equal to the 
amount of active single chain G-CSF dimer protein that is necessary to obtain 50% of the 
maximum proliferation activity of non-conjugated human G-CSF. Thus, the EC50 value is a 
direct measurement of the in vitro activity of the given protein, a lower EC50 value indicat- 
20 ing a higher specific activity. 

Primary assay 2 - in vitro single chain G-CSF dimer activity assay 

The murine hematopoietic cell line BaF3 is transfected with a plasmid carrying the 
human G-CSF receptor and the promoter of the transcription regulator, fos, in front of the 

25 luciferase reporter gene. Upon stimulation of such a cell line with a single chain G-CSF 
dimer sample, a number of intracellular reactions lead to stimulation of fos expression, and 
consequently to expression of luciferase. This stimulation is monitored by the Steady-Glo™ 
Luciferase Assay System (Promega, Cat. No. E2510) whereby the in vitro activity of the G- 
CSF sample may be quantified. 

30 BaF3/hGCSF-R/pfos-lux cells are maintained at 37°C in a humidified 5% C0 2 at- 

mosphere in complete culture media (RPMI-1640/HEPES (Gibco/BRL, Cat. No. 22400), 
10% FBS (HyClone, characterized), lx Penicillin/Streptomycin (Gibco/BRL, Cat. No. 
15140-122), lx L-Glutamine (Gibco/BRL, Cat. No. 25030-081), 10% WEHI-3 conditioned 
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media (source of muIL-3), and grown to a density of 5 x 10 5 cells/mL (confluent). The cells 
are reseeded at about 2 x 10 4 cells/mL every 2-3 days. 

One day prior to the assay, log-phase cells are resuspended at 2 x 10 5 cells/mL in 
starving media (DMEM/F-12 (Gibco/BRL, Cat. No. 11039), 1% BSA (Sigma, Cat. No. 
5 A3675), lx Penicillin/Streptomycin (Gibco/BRL, Cat. No. 15140-122), Ix L-Glutamine 
(Gibco/BRL, Cat. No. 25030-081), 0.1% WEHI-3 conditioned media) and starved for 20 
hours. The cells are washed twice with Dulbecco's PBS (BioWhittaker Cat. No. 17512F), 
and tested for viability using Trypan Blue viability staining. The cells are resuspended in 
assay media (RPMI-1640 (phenol-red free, Gibco/BRL, Cat. No. 11835), 25 mM HEPES, 

10 1% BSA (Sigma, Cat. No. A3675), lx Penicillin/Streptomycin (Gibco/BRL, Cat. No. 

15140-122), lx L-Glutamine (Gibco/BRL, Cat. No. 25030-081) at 4 x 10 6 cells/mL, and 50 
|LiL are aliquotted into each well of a 96-well microtiter plate (Corning). Samples containing 
conjugated or non-conjugated single chain G-CSF dimer or variants thereof are diluted to 
concentrations between l.lxlO" 7 M and l.lxlO 12 M in the assay medium. 50 jllI of each 

15 sample is added to 3 wells containing BaF3/hGCSF-R/pfos-lux cells. A negative control 
consisting of 50 |Lil of medium is added to 8 wells on each microtiter plate. The plates are 
mixed gently and incubated for 2 hours at 37°C. The luciferase activity is measured by fol- 
lowing the Promega Steady-Glo™ protocol (Promega Steady-Glo™ Luciferase Assay Sys- 
tem, Cat. No. E2510). 100 (LiL of substrate is added per well followed by gentle mixing. Lu- 

20 minescence is measured on a TopCount luminometer (Packard) in SPC (single photon 
counting) mode. 

Based on these measurements, dose-response curves for each conjugated and non- 
conjugated single chain G-CSF dimer molecule or variants thereof are calculated, after 
which the EC50 value for each molecule can be determined. 

25 

Secondary assay - binding affinity of single-chain G-CSF dimer or variants thereof to the hG- 
CSF receptor 

Binding of single-chain G-CSF dimer or variants thereof to the hG-CSF receptor is 
studied using standard binding assays. The receptors may be purified extracellular receptor 
30 domains, receptors bound to purified cellular plasma membranes, or whole cells - the cellu- 
lar sources being either cell lines that inherently express G-CSF receptors (e.g. NFS-60) or 
cells transfected with cDNAs encoding the receptors. The ability of single-chain G-CSF 
dimer or variants thereof to compete for the binding sites with native G-CSF is analyzed by 
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incubating with a labeled G-CSF-analog, for instance biotinylated hG-CSF or radioiodinated 
hG-CSF. An example of such an assay is described by Yamasaki et al. (Drugs. Exptl. Clin. 
Res. 24:191-196 (1998)). 

The extracellular domains of the hG-CSF receptor can optionally be coupled to Fc 
5 and immobilized in 96 well plates. Single-chain G-CSF dimer or variants thereof are subse- 
quently added and the binding of these is detected using either specific anti-hG-CSF anti- 
bodies or biotinylated or radioiodinated hG-CSF. 

Measurement of the in vivo half -life of conjugated and non-conjugated single-chain G-CSF 

10 dimer and variants thereof 

An important aspect of the invention is the prolonged biological half-life that is ob- 
tained by construction of a single-chain G-CSF dimer conjugated to a polymer moiety. The 
rapid decrease of hG-CSF serum concentrations has made it important to evaluate biological 
responses to treatment with conjugated and non-conjugated single-chain G-CSF dimers and 

15 variants thereof. Measurement of in vivo biological half-life can be carried out as described 
below. 

Male Sprague Dawley rats (7 weeks old) are used. On the day of administration, the 
weights of the animals are measured (280-310 gram per animal). 100 jJtg per kg body weight 
of the non-conjugated and conjugated single-chain G-CSF dimer samples are each injected 

20 intravenously into the tail vein of five rats. At 1 minute, 30 minutes, 1, 2, 4, 6, 24 and 48 
hours after the injection, 300 (Ltl of blood is withdrawn from a tail vein of each rat. The blood 
samples are stored at room temperature for V/z hours followed by isolation of serum by cen- 
trifugation (4°C, 5000xg for 20 minutes). The serum samples are stored at -80°C until the 
day of analysis. The amount of active single-chain G-CSF dimer in the serum samples is 

25 quantified by a single-chain G-CSF dimer in vitro activity assay (see primary assay 2) after 
thawing the samples on ice. 

Another example of an assay for the measurement of in vivo half-life of single- 
chain G-CSF dimer or variants thereof is described in US 5,824,778, the content of which is 
hereby incorporated by reference. 



30 
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Measurement of the in vivo biological activity in healthy rats of conjugated and non- 
conjugated single-chain G-CSF dimer and variants thereof 

Measurement of the in vivo biological effects of single-chain G-CSF dimer in SPF 
Sprague Dawley rats (purchased from M & B A/S, Denmark) was used to evaluate the bio- 
5 logical efficacy of conjugated and non-conjugated single-chain G-CSF dimer and variants 
thereof. 

On the day of arrival the rats were randomly allocated into groups of 6. The animals 
were acclimatised for a period of 7 days wherein individuals in poor condition or at extreme 
weights were rejected. The weight range of the rats at the start of the acclimatization period 
10 was 250-270g. 

On the day of administration the rats were fasted for 16 hours followed by subcutane- 
ous injection of 100 jitg per kg body weight of Neupogen® or conjugated or non-conjugated 
single-chain G-CSF dimer or a variant thereof. Each single-chain G-CSF dimer sample was 
injected into a group of 6 randomized rats. Blood samples of 300 ]Lil EDTA stabilised blood 

15 were drawn from a tail vein of the rats prior to dosing and at 6, 12, 24, 36, 48, 72, 96, 120, 144 
and 168 hours after dosing. The blood samples were analyzed for total white blood cell counts. 
On the basis of these measurements the biological efficacy of conjugated and non-conjugated 
single-chain G-CSF dimer and variants thereof was evaluated. 

Further examples of assays that can be used to measure the in vivo biological activ- 

20 ity of single-chain G-CSF dimer or variants thereof are described in US 5,681,720, US 
5,795,968, US 5,824,778, US 5,985,265 and by Bowen et al., Experimental Hematology 
27:425-432 (1999). 

Measurement of the in vivo biological activity in rats with chemotherapy-induced neutropenia 
25 of conjugated and non-conjugated single-chain G-CSF dimer and variants thereof 

SPF Sprague Dawley rats were purchased from M & B A/S, Denmark. On the day 
of arrival the rats were randomly allocated into groups of 6. The animals were acclimatised 
for a period of 7 days wherein individuals in poor condition or at extreme weights were re- 
jected. The weight range of the rats at the start of the acclimatization period was 250-270 g. 
30 24 hours before administration of the single-chain G-CSF dimer samples the rats 

were injected i.p. with 50-100 mg per kg body weight of cyclophosphamide (CPA). At day 
0, 100 jag per kg body weight of single-chain G-CSF dimer or a variant thereof was injected 
s.c Each single-chain G-CSF dimer sample was injected into a group of 6 randomized rats. 
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In addition, 5 |Lig per kg body weight of Neupogen® was injected s.c. into a group of 6 ran- 
domized rats at time 0, 24, 48, 72 and 96 hours. Blood samples of 300 [il EDTA stabilized 
blood were drawn from a tail vein of the rats prior to dosing and at 6, 12, 24, 36, 48, 72, 96, 
120, 144 and 168 hours after dosing. The blood samples were analyzed for total white blood 
5 cell counts. On the basis of these measurements the biological efficacy of conjugated and 
non-conjugated single-chain G-CSF dimer and variants thereof was evaluated. 

Determination of polypeptide-receptor binding affinity (on- and off-rate) 

The strength of the binding between a receptor and ligand can be measured using an 

10 enzyme-linked immunoadsorption assay (ELISA), a radioimmunoassay (RIA), or other 
such immunodetection techniques well known in the art. The ligand-receptor binding inter- 
action may also be measured with the Biacore® instrument (Zhou et al., Biochemistry, 1993, 
32, 8193-98; Faegerstram and O'Shannessy, 1993, In Handbook of Affinity Chromatogra- 
phy, 229-52, Marcel Dekker, Inc., NY). 

15 The Biacore® technology allows one to bind receptor to a gold surface and to flow 

ligand over it. Plasmon resonance detection gives direct quantification of the amount of 
mass bound to the surface in real time. This technique yields both on and off-rate constants 
and thus a ligand-receptor dissociation constant and an affinity constant can be directly de- 
termined. 

20 

In vitro immunogenicity test of single-chain G-CSF dimer conjugates 

The reduced immunogenicity of a conjugate of the invention can be determined by 
use of an ELISA method measuring the immunoreactivity of the conjugate relative to a ref- 
erence molecule or preparation. The reference molecule or preparation is normally a recom- 

25 binant human G-CSF preparation such as Neupogen® or another recombinant human G- 
CSF preparation, e.g. an N-terminally PEGylated rhG-CSF molecule as described in US 
5,824,784. The ELISA method is based on antibodies from patients treated with one of these 
recombinant G-CSF preparations. The immunogenicity is considered to be reduced when the 
conjugate of the invention has a statistically significant lower response in the assay than the 

30 reference molecule or preparation. 
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Neutralisation of activity in the single-chain G-CSF dimer activity assay 

The neutralisation of single-chain G-CSF dimer conjugates by anti-G-CSF sera is 
analysed using the primary single-chain G-CSF dimer assay 2 described above. 

Sera from patients treated with the G-CSF reference molecule or from immunised 
5 animals are used. Sera are added either in a fixed concentration (dilution 1:20-1:500 (pt sera) 
or 20-1000 ng/ml (animal sera)) or in five-fold serial dilutions of sera starting at 1:20 (pt 
sera) or 1000 ng/ml (animal sera). Single-chain G-CSF dimer conjugate is added either in 
seven fold-dilutions starting at 10 nM or in a fixed concentration (1-100 pM) in a total vol- 
ume of 80/xl DMEM (Dulbecco's modified Eagle's medium) + 10% FCS (fetal calf serum). 
10 The sera are incubated for 1 hr. at 37°C with single-chain G-CSF dimer conjugate. 

The samples (0.01 ml) are then transferred to 96 well tissue culture plates contain- 
ing NFS-60 cells in 0.1 ml DMEM media. The cultures are incubated for 48 hours at 37°C in 
a 5% C0 2 air atmosphere. 0.01 ml WST-1 (WST-1 cell proliferation agent, Roche Diagnos- 
tics GmbH, Mannheim, Germany) is added to the cultures and incubated for 150 min. at 
15 37°C in a 5% C0 2 air atmosphere. The cleavage of the tetrazolium salt WST-1 by mito- 
chondrial dehydrogenases in viable cells results in the formation of formazan that is quanti- 
fied by measuring the absorbance at 450 nm. 

When single-chain G-CSF dimer conjugate samples are titrated in the presence of a 
fixed amount of serum, the neutralising effect is defined as fold inhibition (FT) quantified as 
20 EC50(with serum)/EC50 (without serum). The reduction of antibody neutralisation of G- 
CSF variant proteins is defined as 

(FT variant -1) 

(1 - ) x 100% 

25 (FI wt - 1) 

EXAMPLE 1 

Construction and cloning of a synthetic gene encoding single-chain G-CSF dimer 

30 A DNA fragment, encoding the YAP3 signal sequence (WO 98/32867, SEQ ID 

NO:2), the TA57 leader sequence (WO 98/32867, SEQ ID NO:3), a KEX2 protease recogni- 
tion site (AAAAGA), G-CSF copy 1 (SEQ ID NO:4) and G-CSF copy 2 (SEQ ID NO:5) in 
the single-chain G-CSF dimer was synthesised following the general procedure described by 
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Stemmer et al. (1995), Gene 164, pp. 49-53. A Bam HI and an Xba I digestion site were in- 
troduced upstream and downstream, respectively, of the gene. The DNA fragment was 
cloned into the Bam HI and Xba I digestion sites in plasmid pJS037 (Okkels, Ann. New 
York Acad. Sci. 1 '82:202-207 1996) using standard DNA techniques, resulting in plasmid 
5 pscG-CSF. 

Another DNA fragment, consisting of a Bam HI digestion site, the Kozak consen- 
sus sequence (Kozak, M., J Mol Biol, Aug. 1987; 196(4):947-50), a sequence encoding the 
hG-CSF signal peptide (SEQ ID NO:7), G-CSF copy 1 with the codon usage optimised for 
expression in CHO cells (SEQ ID NO:8) and G-CSF copy 2 (SEQ ID NO:5) in the single- 
10 chain G-CSF dimer and an Xba I digestion site, was also synthesised following the general 
procedure described by Stemmer et al. (1995), Gene 164, pp. 49-53. The DNA fragment was 
inserted into the Bam HI and Xba I digestion sites in plasmid pcDNA3.1(+) (Invitrogen) 
using standard DNA techniques. This resulted in plasmid pscG-CSFCHO. 

EXAMPLE 2 

15 Expression of single-chain G-CSF dimer in Saccharomyces cerevisiae 

Expression of the single-chain G-CSF dimer in S. cerevisiae YNG318 (available 
from the American Type Culture Collection, VA, USA as ATCC 208973) was performed 
using the following procedure: 10 |ol 0.2 (ig/|Lil pscG-CSF, 10 |xl salmon testes carrier DNA 

20 and 100 |Lil competent S. cerevisiae YNG318 cells were mixed and 600 \x\ 25% PEG 4000 
containing 0.1 M lithium acetate was added. The cells were incubated at 37°C for 30 min- 
utes and placed in a 42°C water bath for 15 minutes. The cells were pelleted by centrifuga- 
tion (4000 rpm, 2 minutes), the supernatant was discharged and the cells were resuspended 
in non-selective YPD medium (1% w/w yeast extract (Difco), 2% w/w peptone bacto 

25 (Difco), 3% w/w dextrose (Roquette)). Following incubation at 37°C for 1 hour, the cells 
were plated on selective SC-without-uracil medium (7.5 g per litre yeast nitrogen base w/o 
amino acids (Difco), 11.3 g per litre Bernstein acid (Merck), 6.8 g per litre NaOH (Merck), 
5.6 g per litre casamino acid w/o vitamin, 0.1 g per litre tryptophan, 20 g per litre glucose 
(Sigma), 0.1 g per litre threonine). 

30 One transformant of S. cerevisiae containing plasmid pscG-CSF with the ability to 

grow on selective medium without uracil was innoculated in 1000 ml liquid SC-without- 
uracil medium containing 0.2% Tween 80 at 37°C for 72 hours. The yeast culture was cen- 
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trifuged (4000 rpm, 5 minutes) and the supernatant was isolated. Expression of single-chain 
G-CSF dimer was verified by Western Blot analysis using the ImmunoPure® Ultra- 
Sensitive ABC Rabbit IgG Staining kit (Pierce) and a polyclonal antibody against hG-CSF 
(Pepro Tech EC Ltd.). It was observed that the protein had the correct size. 

5 EXAMPLE 3 

Expression of single-chain G-CSF dimer in Chinese hamster ovary ( CHO) cells 

The day before transfection the CHO Kl cell line (ATCC #CC1-61) was seeded in 
a T-25 flask in 5 ml DMEM/F-12 medium (Gibco # 31330-038) supplemented with 10% 

10 FBS and penicillin/streptomycin. The following day (at nearly 100% confluency) the trans- 
fection was prepared: 90 |Lil DMEM medium without supplements was aliquoted into a 14 
ml polypropylene tube (Corning). 10 |il Fugene 6 (Roche) was added directly into the me- 
dium and incubated for 5 min at room temperature. In the meantime 5 jLtg plasmid pscG- 
CSFCHO was aliquoted into another 14 ml polypropylene tube. After incubation the Fugene 

15 6 mix was added directly to the DNA solution and incubated for 15 min at room tempera- 
ture. After incubation the whole volume was added drop-wise to the cell medium. 

The next day the medium was exchanged with fresh medium containing 360 |ig/ml 
hygromycin (Gibco). Every day hereafter the selection medium was renewed until the pri- 
mary transfection pool had reached 100% confluency. The primary transfection pool was 

20 subsequently sub-cloned by limited dilution (300 cells seeded in five 96-well plates) 

whereby stable cell lines expressing single chain G-CSF dimer was obtained. One stable cell 
line expressing 8 mg/L single chain G-CSF dimer as determined by ELIS A (Quantikine 
Human G-CSF Immunoassay, R&D Systems Cat. No. DCS50) was transferred to T-175 
flasks. 

25 Cells from 1-2 confluent T-175 flasks were transferred to one roller bottle (1700 

cm 2 ) in 300 ml DMEM/F-12 medium (Life Technologies # 31330) supplemented with 10% 
FBS and penicillin/streptomycin (P/S). The medium was exchanged every second day until 
the bottle was nearly confluent. The medium was then changed to 300 ml serum-free Ultra- 
CHO medium (BioWhittaker # 12-724) supplemented with 1/1000 EX-CYTE (Serologicals 

30 Proteins # 81129N) and P/S. After 4 days (where the medium was exchanged every second 
day) the roller bottle was ready for production and the medium was shifted to the production 
medium: DMEM/F-12 medium without phenol red (Life Technologies #21041) supple- 
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mented with 1/100 ITS A (Life Technologies # 51300-044) [ITS A: Insulin (1.0 g/L) - Trans- 
ferrin (0.55 g/L) - Selenium (0.67 mg/L) supplement for Adherent cultures], 1/1000 EC- 
CYTE and P/S. During the production period the medium was exchanged every day. 

EXAMPLE 4 

5 Purification of single-chain G-CSF dimerfrom yeast culture supernatants 

1000 ml of supernatant from a cultivation of a S. cerevisiae strain expressing sin- 
gle-chain G-CSF dimer was sterile filtered. The ionic strength in the culture supernatant was 
measured. To lower the ionic strength before application onto a cation-exchange column, the 

10 supernatant was diluted with lOmM Na acetate pH 4.5 to a volume of 5000 ml. Finally, the 
pH was adjusted to pH 4.5. 

The diluted sample was applied to an SP Sepharose FF column (20 ml) equilibrated 
in 50 rnM Na acetate, pH 4.5 operated at a flow rate of 5 ml/min. Following application, the 
column was washed with equilibration buffer until the A 2 so in the column effluent reached a 

15 stable level. Elution was performed at a flow rate of 3 ml/min using a linear gradient of 
NaCl (0 M to 0.75 M) in 50 mM Na acetate, pH 4.5 over 20 column volumes. Fractions of 
2.4 ml were collected. The fractions containing single-chain G-CSF dimer were pooled. 
Sample preparation before performing hydrophobic interaction chromatography was made 
by adding 4 volumes of 1 M ammonium sulphate, pH 8 to reach a final concentration of 800 

20 mM ammonium sulphate. The pH of the sample was measured before loading and was 
found to be 8. 

The sample was loaded onto a column packed with 5 ml ToyoPearl Phenylsepha- 
rose-650 equilibrated with 10 column volumes of 1 M ammonium sulphate, pH 8 at a flow 
rate of 2 ml/min. The column was washed with 0.8 M ammonium sulphate until the A 2 so in 
25 the column effluent reached a stable level. Elution of the column was performed using a 
linear gradient (0.8 M ammonium sulphate, pH 8, to MilliQ filtered water) for 60 column 
volumes at a flow rate of 3 ml/min. Fractions of 2.5 ml were collected. 

The concentration of single-chain G-CSF dimer in the fractions from the HIC col- 
umn was determined by ELIS A (Quantikine G-CSF ELIS A assay, R&D Systems catalogue 
30 number DCS50). 

The amino acid sequence of the single-chain G-CSF dimer is given in SEQ ID 

NO:6. 
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EXAMPLE 5 

Purification of single-chain G-CSF dimerfrom CHO cell culture supernatants 

Culture medium from the roller bottle cultivations of CHO cells expressing single 
5 chain G-CSF dimer was harvested and sterile filtered using a 0.22/xm filter. The conductiv- 
ity was adjusted to 75 mS/cm using 4 M sodium chloride and the conditioned medium was 
applied on a column of Phenyl Toyo Pearl 650 S resin equilibrated with 20 mM sodium 
phosphate, pH 7.0; 750 mM sodium chloride. The column was washed to stable base line 
using the equilibration buffer and eluted stepwise using 20 mM sodium phosphate, pH 7.0. 
10 Fractions containing sc-GCSF were pooled and adjusted with 50 mM sodium acetate to 
reach pH 4.5 and a conductivity below 7mS/cm. The adjusted fractions were applied on a 
SP-sepharose FF (Pharmacia) at a flow rate of 150-300 cm/hour. The column was washed 
and eluted by a step gradient 0-100% B -buffer (750 mM sodium chloride, 50 mM sodium 
acetate (pH 4.5) with increments of 10% B-buffer at a flow rate of 150 cm/hour. The frac- 
15 tions containing single chain G-CSF dimer were subsequently pooled. 

The amino acid sequence of the single-chain G-CSF dimer is given in SEQ ID 

NO:6. 

EXAMPLE 6 

Characterization of purified single-chain G-CSF dimer from CHO cells 
20 Purity 

The purity of purified single-chain G-CSF dimer was determined by reverse phase 
HPLC. A single-chain G-CSF dimer sample was applied to a Vydac Cis reverse phase col- 
umn (0.21 x 5 cm) and isocratically eluted with buffer A (0.1% TFA in water) and buffer B 
(0.1% TFA in acetonitril) using the following gradient: 0-5 minutes: 100% buffer A, 5 - 
25 15 minutes: 0-40% buffer B, 15-35 minutes: 40-100% buffer B, 35-40 minutes; 100-0% 
buffer B. The purity was determined to be higher than 90%. The RP-HPLC result was con- 
firmed by SDS-PAGE analysis. 

Identity 

The identity of purified single-chain G-CSF dimer was confirmed by N-terminal 
30 amino acid sequencing. 
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Size 

The purified single-chain G-CSF dimer was analyzed by SDS-PAGE. A single 
band having an apparent molecular weight of 38 kDa was detected. MALDI-TOF mass 
spectrometry of purified single-chain G-CSF dimer showed that the size of the protein was 
5 38.8 kDa corresponding to 2 times the molecular weight of hG-CSF. 

Concentration 

An estimate of the single chain G-CSF dimer concentration in purified samples was 
obtained by spectrophotometric methods. By measuring the absorbance at 280 nm and using 
a theoretical extinction coefficient of 0.83, the protein concentration was calculated. A more 
10 accurate protein determination was obtained by amino acid analysis. Amino acid analysis 
performed on purified single chain G-CSF dimer revealed that the experimentally deter- 
mined amino acid composition was in agreement with the expected amino acid composition 
based on the DNA sequence. 

EXAMPLE 7 

15 Construction of single-chain G-CSF dimer variants 

Specific substitutions of existing amino acids in the single chain G-CSF dimer to 
other amino acid residues, e.g. the specific substitutions discussed above in the general de- 
scription, were introduced using standard DNA techniques known in the art. The new single 
20 chain G-CSF dimer variants were made using plasmid pscG-CSFCHO as DNA template in 
the PGR reactions. The following variants were constructed: Single chain G-CSF dimer 
(K16R) cop y 1,2 (i.e. with the substitution K16R in copy 1 and copy 2), single chain G-CSF 
dimer (K16R K34R) copy 1,2, and single chain G-CSF dimer (K16R K34R K40R) C o P y 1,2- The 
variants were expressed in CHO cells and purified as described in Example 5. 

25 EXAMPLE 8 

Covalent attachment of SPA-PEG to hG-CSF or variants thereof 

SPA-PEG 5000, SPA-PEG 12000 and SPA-PEG 20000 (Shearwater Corp.) were 
covalently linked to single chain G-CSF dimer and variants thereof purified from CHO cells 
30 as described above ("PEGylation of single chain G-CSF dimer and variants thereof in solu- 
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tion"). The apparent sizes of the PEGylated compounds as determined by SDS-PAGE are 
listed below. 



Molecule 


Apparent size on SDS-PAGE (kDa) 


Single-chain G-CSF dimer 


38 


Single-chain G-CSF dimer PEG5000 


80-120 


Single-chain G-CSF dimer PEG12000 


120-180 


Single-chain G-CSF dimer PEG20000 


120-200 



Molecule 


Apparent size on 
SDS-PAGE (kDa) 


Single-chain G-CSF dimer (K16R) copy i t2 


38 


Single-chain G-CSF dimer (K16R) cop y 1,2 PEG5000 


60-110 


Single-chain G-CSF dimer (K16R) cop y 1,2 PEG12000 


120-160 


Single-chain G-CSF dimer (K16R K34R) copy 1,2 


38 


Single-chain G-CSF dimer (K16R K34R) copy 1,2 PEG5000 


55-110 


Single-chain G-CSF dimer (K16R K34R) copy h2 PEG12000 


120-160 


Single-chain G-CSF dimer (K16R K34R K40R) copy i, 2 


38 


Single-chain G-CSF dimer (K16R K34R K40R) copy 1,2 PEG5000 


55-110 


Single-chain G-CSF dimer (K16R K34R K40R) copy u PEG12000 


120-160 



EXAMPLE 9 

In vitro biological activity of non-conjugated and conjugated hG-CSF and variants thereof 

10 The in vitro biological activities of conjugated and non-conjugated single chain G- 

CSF dimer and variants thereof were measured as described above in "Primary assay 2 - in 
vitro single chain G-CSF dimer activity assay". The in vitro bioactivities, determined on the 
basis of the measured EC50 values for each variant with and without conjugation of SPA- 
PEG to the available PEGylation sites, are listed below. The values have been normalized 

15 with respect to the EC50 value of non-conjugated hG-CSF (Neupogen®), which in this as- 
say is 30 pM. This value was measured simultaneously with that of the variants under iden- 
tical assay conditions. The values in the table thus indicate % specific activity relative to the 
activity of non-conjugated hG-CSF. 
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Molecule 


Specific activity 
(% of Neupogen®) 


Single-chain G-CSF dimer 


214 


Single-chain G-CSF dimer PEG5000 


120 


Single-chain G-CSF dimer PEG12000 


62 


Single-chain G-CSF dimer PEG20000 


25 




Molecule 


Specific activity 
(% of Neupogen®) 


Single-chain G-CSF dimer (K16R) cop y 1,2 


135 


Single-chain G-CSF dimer (K16R K34R) CO py 1,2 


329 


Single-chain G-CSF dimer (K16R K34R) cop y 1,2 PEG5000 


19 


Single-chain G-CSF dimer (K16R K34R) CO py 1,2 PEG12000 


8 


Single-chain G-CSF dimer (K16R K34R K40R) cop y 1,2 


300 


Single-chain G-CSF dimer (K16R K34R K40R) cop y 1,2 PEG5000 


10 


Single-chain G-CSF dimer (K16R K34R K40R) cop y 1,2 PEG12000 


10 



The above results show that the non-PEGylated single-chain dimers all have a 
higher in vitro activity than that of hG-CSF. This increased activity is believed to be due to 
5 an increased binding affinity. It may also be seen that PEGylation substantially reduces the 
activity, and that this reduction in activity is greater when the MW of the PEG molecules is 
higher. 

EXAMPLE 10 

In vivo biological activity in healthy rats of non-conjugated and conjugated single chain hG- 
1 0 CSF dimer and variants thereof 

The in vivo biological activities of non-conjugated hG-CSF (Neupogen®), non- 
conjugated single-chain G-CSF dimer, SPA-PEG 5000 conjugated single-chain G-CSF 
dimer, SPA-PEG 12000 conjugated single-chain G-CSF dimer, and SPA-PEG 20000 conju- 
15 gated single-chain G-CSF dimer in healthy rats were measured as described above ("Meas- 
urement of the in vivo biological activity in healthy rats of conjugated and non-conjugated 
single-chain G-CSF dimer and variants thereof). The results are shown in Figure 1. 

The initial formation of white blood cells (first 12 hours) in rats receiving any of 
the four preparations of single-chain G-CSF dimer occurred with the same rate as observed 
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in rats receiving hG-CSF (Neupogen®). Thus, single-chain G-CSF dimer stimulates forma- 
tion of white blood cells in vivo as efficiently as hG-CSF. The level of white blood cells in 
rats receiving either hG-CSF (Neupogen®) or non-conjugated single-chain G-CSF dimer 
returned to normal 48 hours after injection. In rats receiving SPA-PEG 5000 conjugated 
5 single-chain G-CSF dimer, SPA-PEG 12000 conjugated single-chain G-CSF dimer, and 
SPA-PEG 20000 conjugated single-chain G-CSF dimer, the level of white blood cells re- 
turned to normal only after about 72 hours, 72 hours and 96 hours, respectively, after injec- 
tion. Thus, the duration of action of the PEGylated single-chain G-CSF dimers is signifi- 
cantly longer than that of hG-CSF (Neupogen®). 

10 EXAMPLE 11 

In vivo biological activity in rats with chemotherapy-induced neutropenia of non-conjugated 
and conjugated single-chain G-CSF dimer and variants thereof 

The in vivo biological activities of non-conjugated hG-CSF (Neupogen®), SPA- 

15 PEG5000 conjugated single-chain G-CSF dimer, and SPA-PEG20000 conjugated single- 
chain G-CSF dimer in rats with chemotherapy-induced neutropenia were measured as de- 
scribed above ("Measurement of the in vivo biological activity in rats with chemotherapy- 
induced neutropenia of conjugated and non-conjugated single-chain G-CSF dimer and vari- 
ants thereof) using 50 mg per kg body weight of cyclophosphamide (CPA). The results are 

20 shown in Figure 2. 

The initial stimulation of white blood cell formation during the first 12 hours was 
identical in rats receiving hG-CSF (Neupogen®), SPA-PEG5000 conjugated single-chain G- 
CSF dimer, or SPA-PEG20000 conjugated single-chain G-CSF dimer. After 36 hours the 
number of white blood cells (WBC) in the Neupogen®-treated rats dropped to the level that 

25 was observed in the untreated group (about 2xl0 9 cells per litre). At this point the rats were 
neutropenic. The level of white blood cells in the Neupogen-treated group reached normal 
levels (lOxlO 9 cells per litre) after 168 hours. 

The level of white blood cells in the two groups treated with SPA-PEG 5000 con- 
jugated single-chain G-CSF dimer and SPA-PEG 20000 conjugated single-chain G-CSF 

30 dimer dropped to a minimum of about 2xl0 9 cells per litre only after 48 hours. The white 
blood cell levels in the two groups were back to normal after 144 hours and 120 hours, re- 
spectively. Thus, the two PEG-conjugated single chain G-CSF dimer compounds were able 
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to both relieve the degree of neutropenia and to reduce the time until the white blood cell 
levels were back to normal (the duration of neutropenia) significantly from 132 hours in the 
Neupogen®-treated group to 96 hours and 72 hours, respectively, in the groups treated with 
either SPA-PEG 5000 conjugated single chain G-CSF dimer or SPA-PEG 20000 conjugated 
5 single chain G-CSF dimer. 
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CLAIMS 

1. A single-chain multimeric polypeptide conjugate comprising at least two units of a 
monomelic polypeptide linked via a peptide bond or a peptide linker, wherein the 

5 monomelic polypeptide is of a type that is biologically active in monomelic form, and 
having at least one polymer moiety covalently bound to an attachment group of said 
polypeptide. 

2. A single-chain multimeric polypeptide conjugate comprising at least two units of a 
10 monomelic polypeptide linked via a peptide bond or a peptide linker, wherein the 

monomelic polypeptide is of a type that is biologically active in monomeric form, and 
wherein at least one of said units differs from the corresponding wild-type polypeptide in 
that at least one amino acid residue comprising an attachment group for a non- 
polypeptide moiety has been introduced or removed, and having at least one non- 
15 polypeptide moiety covalently bound to an attachment group of said polypeptide. 

3. The single-chain polypeptide conjugate of claim 2, wherein at least one monomeric unit 
differs from the corresponding wild-type polypeptide in at least one amino acid residue 
modification selected from the group consisting of introduction of a lysine, cysteine, as- 

20 partic acid, glutamic acid or histidine residue; removal of a lysine, cysteine, aspartic 
acid, glutamic acid or histidine residue; introduction of an N- or O-glycosylation site; 
and removal of an N- or O-glycosylation site. 

4. The single-chain polypeptide conjugate of any of the preceding claims, wherein at least 
25 one monomeric unit differs from the corresponding wild-type polypeptide in that least 

one attachment site for a non-polypeptide moiety has been introduced in a position that 
in the wild-type polypeptide is occupied by a surface-exposed amino acid residue, and/or 
wherein at least one amino acid residue that is surface-exposed in the wild-type polypep- 
tide has been removed. 

30 

5. The single-chain polypeptide conjugate of any of the preceding claims, wherein each 
monomeric polypeptide unit has a molecular weight of less than about 34 kDa, such as 
less than about 30 kDa, exclusive of any polymer moiety covalently bound to said unit. 



WO 02/36626 



PCT/DK01/00724 



85 

6. The single-chain polypeptide conjugate of any of the preceding claims, comprising two 
or more monomelic polypeptide units with the same amino acid sequence. 

5 7. The single-chain polypeptide conjugate of claim 6, comprising two or more monomelic 
units with a wild-type amino acid sequence. 

8. The single-chain polypeptide conjugate of claim 6, comprising two or more units that are 
modified, in relation to the relevant wild-type amino acid sequence, by introduction . 

10 and/or removal of one or more amino acid residues. 

9. The single-chain polypeptide conjugate of any of claims 1-5, comprising two or more 
monomelic polypeptide units with different amino acid sequences. 

15 10. The single-chain polypeptide conjugate of claim 9, comprising at least one monomelic 
unit with a wild-type amino acid sequence and least one unit that is modified, in relation 
to said wild-type amino acid sequence, by introduction and/or removal of one or more 
amino acid residues. 

20 11. The single-chain polypeptide conjugate of claim 9, comprising at least two different 

monomelic units that are modified, in relation to the relevant wild-type amino acid se- 
quence, by introduction and/or removal of one or more amino acid residues. 

12. The single-chain polypeptide conjugate of any of the preceding claims, wherein the 
25 polypeptide is a cytokine, a growth factor or a hormone. 

13. The single-chain polypeptide conjugate of claim 12, wherein the monomelic units are 
selected from the group consisting of interleukins such as interleukin-1 alpha (IL-la), in- 
terleukin-lbeta (IL-1P), interleukin-lra (IL-lra), interleukin-2 (IL-2), interleukin-3 (IL- 

30 3), interleukin-4 (IL-4), interleukin-6 (IL-6), interleukin-7 (IL-7), interleukin-9 (IL-9), 

interleukin-1 1 (IL-11), interleukin-1 3 (IL-13), interleukin-1 5 (DL-15), interleukin 17 (DL- 
17) and interleukin 18 (IL-18); colony stimulating factors such as granulocyte colony 
stimulating factor (G-CSF) and granulocyte macrophage colony stimulating factor (GM- 
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CSF); growth factors such as stem cell growth factor; interferons such as interferon al- 
pha (INF-a) and interferon beta (INF- (3); and members of the tumour necrosis family 
such as tumour necrosis factor alpha (TNF-a), tumour necrosis factor beta (TNF-p) and 
osteoprotegerin ligand (OPGL). 

5 

14. The single-chain polypeptide conjugate of claim 13, wherein the conjugate has G-CSF 
activity. 

15. The single-chain polypeptide conjugate of any of the preceding claims, comprising at 

10 least one covalently bound non-polypeptide moiety selected from the group consisting of 
polymer molecules, lipophilic compounds, oligosaccharide moieties and organic 
derivatizing agents. 

16. The single-chain polypeptide conjugate of claim 15, comprising at least one covalently 
15 bound polymer molecule selected from the group consisting of linear and branched 

polyalkylene oxides. 

17. The single-chain polypeptide conjugate of claim 16, wherein the polymer molecule is 
polyethylene glycol. 

20 

18. The single-chain polypeptide conjugate according to any of the preceding claims which 
has a functional in vivo half-life and/or serum half -life that compared to that of a corre- 
sponding non-conjugated monomelic polypeptide is increased by at least about 25%, 
preferably at least about 50%, e.g. at least about 100%. 

25 

19. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide bond or a peptide linker, wherein at least one of said 
monomers is a variant of wild-type human G-CSF comprising at least one amino acid 
residue modification selected from the group consisting of introduction of a lysine, cys- 

30 teine, aspartic acid, glutamic acid or histidine residue, and removal of a lysine, cysteine, 
aspartic acid, glutamic acid or histidine residue. 
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20. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide bond or a peptide linker, wherein at least one of said 
monomers is a variant of wild-type human G-CSF comprising at least one amino acid 
residue modification selected from the group consisting of introduction of an O- 

5 glycosylation site, and removal of an N- or O-glycosylation site. 

21. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide bond or a peptide linker, wherein at least one of said 
monomers is a variant of wild-type human G-CSF comprising at least one introduced at- 

10 tachment site for a polymer moiety. 

22. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide bond or a peptide linker, wherein at least one of said 
monomers is a variant of wild-type human G-CSF wherein at least one attachment site 

15 for a non-polypeptide moiety has been introduced in a position that in wild-type human 
G-CSF is occupied by a surface-exposed amino acid residue. 

23. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide bond or a peptide linker, wherein at least one of said 

20 monomers is hG-CSF with the amino acid sequence shown in SEQ ID NO: 1. 

24. The single-chain multimeric G-CSF polypeptide of claim 23, comprising two G-CSF 
polypeptide monomers, both of which have the amino acid sequence shown in SEQ ID 
NO:l. 

25 

25. A single-chain multimeric G-CSF polypeptide comprising at least two G-CSF polypep- 
tide monomers linked via a peptide linker, wherein the peptide linker comprises at least 
one amino acid residue comprising an attachment group for a non-polypeptide moiety. 

30 26. A single-chain multimeric G-CSF polypeptide conjugate comprising at least one non- 
polypeptide moiety covalently attached to a single-chain multimeric G-CSF polypeptide 
according to any of claims 19-25. 
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27. The polypeptide conjugate of claim 26, wherein the non-polypeptide moiety is a polymer 
molecule. 

28. The polypeptide conjugate of claim 27, wherein the polymer molecule is a linear or 
5 branched polyethylene glycol or another polyalkylene oxide. 

29. A nucleotide sequence encoding a single-chain polypeptide according to any of the pre- 
ceding claims. 

10 30. An expression vector comprising a nucleotide sequence according to claim 29. 

31. A recombinant host cell comprising a nucleotide sequence according to claim 29 or an 
expression vector according to claim 30. 

15 32. A method for preparing a single-chain multimeric polypeptide or polypeptide conjugate 
according to any of claims 1-28, comprising culturing a recombinant host cell according 
to claim 31 comprising a single nucleotide sequence encoding said polypeptide in a suit- 
able culture medium under conditions permitting expression of the nucleotide sequence, 
and recovering the resulting polypeptide from the cell culture, followed, where appropri- 

20 ate, by subjecting the polypeptide to conjugation with a non-polypeptide moiety under 
suitable reaction conditions to result in a polypeptide conjugate. 

33. A composition comprising a single-chain multimeric polypeptide or polypeptide conju- 
gate according to any of claims 1-28 together with at least one excipient or vehicle. 

25 

34. Use in therapy of a single-chain multimeric polypeptide or polypeptide conjugate ac- 
cording to any of claims 1-28. 

35. Use of a single-chain multimeric polypeptide or polypeptide conjugate according to any 
30 of claims 1-28 for the manufacture of a medicament. 



36. Use of single-chain multimeric polypeptide or polypeptide conjugate according to any of 
claims 19-28 for the manufacture of a medicament for treatment of general haematopoi- 
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etic disorders, including disorders arising from radiation therapy, chemotherapy or bone 
marrow transplantations, treatment of AIDS or other immunodeficiency diseases, treat- 
ment of leukopenia and treatment of acute myeloid leukemia. 

5 37. A method of treating a mammal having a general haematopoietic disorder, including 

disorders arising from radiation therapy, chemotherapy or bone marrow transplantations, 
treatment of AIDS or other immunodeficiency diseases, treatment of leukopenia and 
treatment of acute myeloid leukaemia, comprising administering to a mammal in need 
thereof an effective amount of a single-chain multimeric polypeptide or polypeptide con- 

10 jugate according to any of claims 19-28. 
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SEQ ID NO:l 

Human G-CSF 

TPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLL 
GHSLGIPWAPLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELG 
PTLDTLQLDVADFATTIWQQMEELGMAPALQPTQGAMPAFASAFQRRAGG 
VLVASHLQSFLEVSYRVLRHLAQP 

SEQ ID NO: 2 

DNA encoding the YAP3 signal sequence 

ATGAAATTGAAAACTGTTAGATCTGCTGTTTTGTCTTCTTTGTTTGCTTCTCAAGTTTT 
GGGT 

SEQ ID NO: 3 

DNA encoding the TA57 leader sequence 

CAACCAATTGATGATACTGAATCTCAAACTACTTCTGTTAATTTGATGGCTGATGATAC 
TG AATC TGC TTTTGC T AC TC AAAC T AATTC TGGTGGTTTGG ATGTTGTTGGTTTGAT AT 
CGATGGCC 

SEQ ID NO: 4 

DNA encoding G-CSF copy 1 in the single chain G-CSF dimer 

ACTCCATTGGGTCCAGCTTCTTCTTTGCCACAATCTTTTTTGTTGAAATGTTTGGAACA 
AGTTAGAAAAATTCAAGGTGATGGTGCTGCTTTGCAAGAAAAATTGTGTGCTACTTATA 
AATTGTGTCATCCAGAAGAATTGGTTTTGTTGGGTCATTCTTTGGGTATTCCATGGGCT 
CCATTGTCTTCTTGTCCATCTCAAGCTTTGCAATTGGCTGGTTGTTTGTCTCAATTGCA 
TTCTGGTTTGTTTTTGTATCAAGGTTTGTTGCAAGCTTTGGAAGGTATTTCTCCAGAAT 
TGGGTCCAACTTTGGATACTTTGCAATTGGATGTTGCTGATTTTGCTACTACTATTTGG 
CAACAAATGGAAGAATTGGGTATGGCTCCAGCTTTGCAACCAACTCAAGGTGCTATGCC 
AGCTTTTGCTTCTGCTTTTCAAAGAAGAGCTGGTGGTGTTTTGGTTGCTTCTCATTTGC 
AATCTTTTTTGGAAGTTTCTTATAGAGTTTTGAGACATTTGGCTCAACCA 

SEQ ID NO: 5 

DNA encoding G-CSF copy 2 in the single chain G-CSF dimer 

ACCCCTCTGGGCCCGGCCAGCAGTCTGCCTCAGAGTTTTTTACTGAAATGCTTAGAACA 
GGTGCGTAAAATCCAGGGCGATGGCGCGGCCCTGCAGGAAAAACTGTGCGCGACCTATA 
AACTGTGCCATCCTGAAGAACTGGTCCTGTTAGGCCATAGCTTAGGCATCCCGTGGGCG 
CCTCTGAGTAGCTGCCCGAGTCAGGCCCTGCAGCTGGCCGGCTGCCTGAGTCAGTTACA 
TAGTGGCTTATTTTTATATCAGGGCTTACTGCAGGCGTTAGAAGGCATTAGTCCGGAAC 
TGGGCCCGACCCTGGATACCTTACAGTTAGATGTCGCGGATTTTGCCACCACCATTTGG 
CAGCAGATGGAAGAATTAGGCATGGCGCCTGCGTTACAGCCTACCCAGGGCGCCATGCC 
TGCGTTTGCGAGTGCGTTTCAGCGTCGCGCCGGCGGCGTGTTAGTGGCCAGCCATCTGC 
AGAGCTTTCTGGAAGTGAGTTATCGTGTGTTACGCCATCTGGCCCAGCCTTAATCTAGA 



1/3 
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SEQ ID NO: 6 

Single chain G-CSF dimer polypeptide 

TPLGPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIPWA 
PLSSCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIW 
QQMEELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLAQPTPL 
GPASSLPQSFLLKCLEQVRKIQGDGAALQEKLCATYKLCHPEELVLLGHSLGIPWAPLS 
SCPSQALQLAGCLSQLHSGLFLYQGLLQALEGISPELGPTLDTLQLDVADFATTIWQQM 
EELGMAPALQPTQGAMPAFASAFQRRAGGVLVASHLQSFLEVSYRVLRHLAQP 

SEQ ID NO: 7 

DNA encoding human G-CSF signal peptide 

ATGGCTGGACCTGCCACCCAGAGCCCCATGAAGCTGATGGCCCTGCAGCTGCTGCTGTG 
GCACAGTGCACTCTGGACAGTGCAGGAAGCC 



SEQ ID NO: 8 

DNA encoding single-chain G-CSF copy 1 (codon usage 
optimized for expression in CHO cells) 

ACTCCATTGGGTCCAGCTTCTTCTTTGCCACAATCTTTTTTGTTGAAATGTTTGGAACA 
AGTTAGAAAAATTCAAGGTGATGGTGCTGCTTTGCAAGAAAAATTGTGTGCTACTTATA 
AATTGTGTCATCCAGAAGAATTGGTTTTGTTGGGTCATTCTTTGGGTATTCCATGGGCT 
CCATTGTCTTCTTGTCCATCTCAAGCTTTGCAATTGGCTGGTTGTTTGTCTCAATTGCA 
TTCTGGTTTGTTTTTGTATCAAGGTTTGTTGCAAGCTTTGGAAGGTATTTCTCCAGAAT 
TGGGTCCAACTTTGGATACTTTGCAATTGGATGTTGCTGATTTTGCTACTACTATTTGG 
CAACAAATGGAAGAATTGGGTATGGCTCCAGCTTTGCAACCAACTCAAGGTGCTATGCC 
AGCTTTTGCTTCTGCTTTTCAAAGAAGAGCTGGTGGTGTTTTGGTTGCTTCTCATTTGC 
AATCTTTTTTGGAAGTTTCTTATAGAGTTTTGAGACATTTGGCTCAACCA 



SEQ ID NO: 9 

Synthetic tag 

His-His-His-His-His-His 



SEQ ID NO: 10 

Synthetic tag 



Met-Lys -His-His-His-His-His-His 



SEQ ID NO: 11 

Synthetic tag 

Met-Lys -His -His -Ala-Hi s -Hi s-Gln-His-His 
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SEQ ID NO: 12 

Synthetic tag 

Met-Lys-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln 



SEQ ID NO: 13 

Synthetic tag 

Met-Lys-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-His-Gln-Gln 
SEQ ID NO: 14 

C-terminal tag (Mol . Cell. Biol. 5:3610-16, 1985) 



EQKLISEEDL 



SEQ ID NO: 15 

C- or N-terminal tag 



DYKDDDDK 



SEQ ID NO: 16 

Synthetic tag 

YPYDVPDYA 
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SEQUENCE LISTING 



<110> Maxygen ApS 

Maxygen Holdings Ltd. 



<±z u> 


Single— Chain Polypeptides 


>1 j U-> 


Uzl o wo ziu 


<150> 


DK PA 2000 01647 


<151> 


2000-11-02 


<160> 


16 


<170> 


Patentln version 3.1 


<210> 


1 


<211> 


174 


<212> 


PRT 


<213> 


Homo sapiens 


<400> 


1 


Thr Pro Leu Gly Pro Ala Ser Ser Leu 



10 15 



Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
20 25 30 



Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
35 40 45 



Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly lie Ser 
85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 



Phe Ala Thr Thr lie Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 
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Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
165 170 



<210> 


2 


<211> 


63 


<212> 


DNA 


<213> 


Saccharomyces cerevisiae 


<400> 


2 


atgaaattga aaactgttag atctgctgtt 


ggt 




<210> 


3 


<211> 


126 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


leader sequence 


<400> 


3 



60 
63 



caaccaattg 


atgatactga 


atctcaaact 


acttctgtta 


atttgatggc 


tgatgatact 


60 


gaatctgctt 


ttgctactca 


aactaattct 


ggtggtttgg 


atgttgttgg 


tttgatatcg 


120 


atggcc 












126 


<210> 4 
<211> 522 
<212> DNA 

<213> Artificial Sequence 










<220> 

<223> DNA 


encoding G- 


-CSF copy 1 


in the single chain G- 


-CSF dirtier 




<400> 4 
actccattgg 


gtccagcttc 


ttctttgcca 


caatcttttt 


tgttgaaatg 


tttggaacaa 


60 


gttagaaaaa 


ttcaaggtga 


tggtgctgct 


ttgcaagaaa 


aattgtgtgc 


tacttataaa 


120 


ttgtgtcatc 


cagaagaatt 


ggttttgttg 


ggtcattctt 


tgggtattcc 


atgggctcca 


180 


ttgtcttctt 


gtccatctca 


agctttgcaa 


ttggctggtt 


gtttgtctca 


attgcattct 


240 


ggtttgtttt 


tgtatcaagg 


tttgttgcaa 


gctttggaag 


gtatttctcc 


agaattgggt 


300 


ccaactttgg 


atactttgca 


attggatgtt 


gctgattttg 


ctactactat 


ttggcaacaa 


360 


atggaagaat 


tgggtatggc 


tccagctttg 


caaccaactc 


aaggtgctat 


gccagctttt 


42 0 


gcttctgctt 


ttcaaagaag 


agctggtggt 


gttttggttg 


cttctcattt 


gcaatctttt 


480 


ttggaagttt 


cttatagagt 


tttgagacat 


ttggctcaac 


ca 
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<210> 5 
<211> 531 
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<212> DNA 

<213> Artificial Sequence 










<220> 

<223> DNA 


encoding G- 


-CSF copy 2 


in the single chain G- 


-CSF dimer 




<400> 5 


rr r~* r~* rrrrr^ t~* ^ rr 




cagagttttt 


tactgaaatg 


cttagaacag 


60 


y uyuy o cxa. ca.cx 


ui-^ay yyuya 




ctgcaggaaa 


aactgtgcgc 


gacctataaa 


120 


ptcrt crcCr? t" r* 

V_- y Vw-» C4, *w 


c t ct3 ^ cr;=a art" 


rrrrt" pp i - rrt* t* f\ 

y y ^v_. i_ y t_- i_ a. 


ggecataget 


taggcatccc 


gtgggcgcct 


180 


c fccrr3 crt* 3 err 1 t~ 


cr n r* r* rr a rr t"C r5 


arte pp 1~ aPri rr 


ctggccggct 


gectgagtea 


gttacatagt 


240 


ggcttatttt 


tatatcaggg 


ettactgeag 


gcgttagaag 


gcattagtcc 


ggaactgggc 


300 


ccgaccctgg 


ataccttaca 


gttagatgtc 


gcggattttg 


ccaccaccat 


u y udy dy 


7 £ n 

J D U 


atggaagaat 


taggcatggc 


gectgegtta 


cagcctaccc 


agggegecat 


gcctgcgttt 


420 


gcgagtgcgt 


ttcagegteg 


cgccggcggc 


gtgttagtgg 


ccagccatct 


gcagagcttt 


480 


ctggaagtga 


gttatcgtgt 


gttacgecat 


ctggcccagc 


cttaatctag 


a 


531 



<210> 6 
<211> 348 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Single chain G-CSF dimer polypeptide 
<400> 6 

Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys 
15 10 15 



Cys Leu Glu Gin Val Arg Lys lie Gin Gly Asp Gly Ala Ala Leu Gin 
20 25 30 



Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val 
35 40 45 



Leu Leu Gly His Ser Leu Gly lie Pro Trp Ala Pro Leu Ser Ser Cys 
50 55 60 



Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser 
65 70 75 80 



Gly Leu Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser 
85 90 95 



Pro Glu Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp 
100 105 110 
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Phe Ala Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro 
115 120 125 



Ala Leu Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe 
130 135 140 



Gin Arg Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe 
145 150 155 160 



Leu Glu Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro Thr Pro 
165 170 175 



Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Leu Leu Lys Cys Leu 
180 185 190 



Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala Leu Gin Glu Lys 
195 200 205 



Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu Glu Leu Val Leu Leu 
210 215 220 



Gly His Ser Leu Gly He Pro Trp Ala Pro Leu Ser Ser Cys Pro Ser 
225 230 235 240 



Gin Ala Leu Gin Leu Ala Gly Cys Leu Ser Gin Leu His Ser Gly Leu 
245 250 255 



Phe Leu Tyr Gin Gly Leu Leu Gin Ala Leu Glu Gly He Ser Pro Glu 
260 265 270 



Leu Gly Pro Thr Leu Asp Thr Leu Gin Leu Asp Val Ala Asp Phe Ala 
275 280 285 



Thr Thr He Trp Gin Gin Met Glu Glu Leu Gly Met Ala Pro Ala Leu 
290 295 300 



Gin Pro Thr Gin Gly Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg 
305 310 315 320 



Arg Ala Gly Gly Val Leu Val Ala Ser His Leu Gin Ser Phe Leu Glu 
325 330 335 



Val Ser Tyr Arg Val Leu Arg His Leu Ala Gin Pro 
340 345 
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<210> 7 

<211> 90 

<212> DNA 

<213> Homo sapiens 

<400> 7 

atggctggac ctgccaccca gagccccatg aagctgatgg ccctgcagct gctgctgtgg 60 
cacagtgcac tctggacagt gcaggaagcc 90 



<210> 8 

<211> 522 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> DNA encoding single-chain G-CSF copy 1 (codon usage optimized for 
expression in CHO cells) 



<400> 8 
actccattgg 


gtccagcttc 


ttctttgcca 


caatcttttt 


tgttgaaatg 


tttggaacaa 


60 


gttagaaaaa 


ttcaaggtga 


tggtgctgct 


ttgcaagaaa 


aattgtgtgc 


tacttataaa 


120 


ttgtgtcatc 


cagaagaatt 


ggttttgttg 


ggtcattctt 


tgggtattcc 


atgggctcca 


180 


ttgtcttctt 


gtccatctca 


agctttgcaa 


ttggctggtt 


gtttgtctca 


attgcattct 


240 


ggtttgtttt 


tgtatcaagg 


tttgttgcaa 


gctttggaag 


gtatttctcc 


agaattgggt 


300 


ccaactttgg 


atactttgca 


attggatgtt 


gctgattttg 


ctactactat 


ttggcaacaa 


360 


atggaagaat 


tgggtatggc 


tccagctttg 


caaccaactc 


aaggtgctat 


gccagctttt 


420 


gcttctgctt 


ttcaaagaag 


agctggtggt 


gttttggttg 


cttctcattt 


gcaatctttt 


480 


ttggaagttt 


cttatagagt 


tttgagacat 


ttggctcaac 


ca 




522 



<210> 9 

<211> 6 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> tag 

<400> 9 

His His His His His His 
1 5 



<210> 10 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> tag 
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<400> 10 

Met Lys His His His His His His 



<210> 
<211> 
<212> 
<213> 


11 
10 
PRT 

Artificial Sequence 


<220> 
<223> 


tag 


<400> 


11 


Met Lys His His Ala His His 
1 5 


<210> 
<211> 
<212> 
<213> 


12 
14 
PRT 

Artificial Sequence 


<220> 
<223> 


tag 


<400> 


12 


Met Lys His Gin His Gin His 
1 5 


<210> 
<211> 
<212> 
<213> 


13 
15 
PRT 

Artificial Sequence 


<220> 
<223> 


tag 


<400> 


13 


Met Lys His Gin His Gin His 
1 5 


<210> 
<211> 
<212> 
<213> 


14 
10 
PRT 

Artificial Sequence 


<220> 
<223> 


tag 


<400> 


14 



10 



10 



10 15 



Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 
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<210> 15 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> tag 

<400> 15 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



<210> 16 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> tag 

<400> 16 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



7 



INTERNATIONAL SEARCH REPORT 



lnterri|^pl Application No 

?CjWk 01/00724 



A. CLASSIFICATION OF SUBJECT MATTER ' 

IPC 7 C07K14/535 A61K47/48 C07K14/52 C12N15/63 A61P7/00 
A61K38/19 

According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C07K A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal , WPI Data, PAJ, CHEM ABS Data, BIOSIS, MEDLINE, EMBASE 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


US 5 684 136 A (G0DOWSKI PAUL J) 
4 November 1997 (1997-11-04) 
cited in the application 
abstract 
column 5 

column 13 -column 15 


1-37 


X 


WO 99 02710 A (BETH ISRAEL HOSPITAL 
;SYTKOWSKI ARTHUR J (US)) 
21 January 1999 (1999-01-21) 
cited in the application 
page 25 
page 18-23 


1-37 


X 


EP 0 783 003 A (KY0WA HAKKO K0GY0 KK) 
9 July 1997 (1997-07-09) 
page 6 

_/__ 


1-37 



Ls 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



° Special categories of cited documents : 

"A" document defining the general state of the art which Is not 
considered to be of particular relevance 

"E" earlier document but published on or after the international 
filing date 

'L' document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O* document referring to an oral disclosure, use, exhibition or 
other means 

■P' document published prior to the international filing date but 
later than the priority date claimed 



'T' later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

19 March 2002 


Date of mailing of the international search report 

28/03/2002 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
ML - 2280 HV Rijsw/jk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Cervigni, S j 



Form PCT/ISA/210 (second sheet) (Jul/ 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category ° 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 



Intem^feia) Application No 

PCT/m 01/00724 



WO 96 05224 A (UNIV WASHINGTON) 
22 February 1996 (1996-02-22) 
page 3 
page 9 
page 23 

W0 96 40772 A (JOHNSON & JOHNSON ; JOHNSON 
DANA L (US); ZIVIN ROBERT A (US)) 
19 December 1996 (1996-12-19) 
abstract 

FRANCIS G E ET AL: "PEGYLATI0N OF 
CYTOKINES AND OTHER THERAPEUTIC PROTEINS 
AND PEPTIDES: THE IMPORTANCE OF BIOLOGICAL 
OPTIMISATION OF COUPLING TECHNIQUES" 
INTERNATIONAL JOURNAL OF 

HEMATOLOGY, ELSEVIER SCIENCE PUBLISHERS, NL, 
vol. 68, no. 1, 1998, pages 1-18, 
XP000907109 
ISSN: 0925-5710 
the whole document 

W0 94 17185 A (AMGEN INC) 
4 August 1994 (1994-08-04) 
page 9; claim 21 

WO 96 11953 A (AMGEN INC) 
25 April 1996 (1996-04-25) 
abstract 

W0 99 67291 A (PETTIT DEAN K ;IMMUNEX CORP 
(US)) 29 December 1999 (1999-12-29) 
abstract 



1-37 



1-37 



1-37 



1-37 



1-37 



1-37 



Form PCT/ISA/210 (continuation of second sheet) (July 1992) 

page 2 of 2 



International Application No. PCT/DK 01 £0724 



FURTHER INFORMATION CONTINUED FROM PCX/ISA/ 210 



Continuation of Box 1.2 
Claims Nos. : 1-18,29-35 



Present claims 1-18,29-35 relate to an extremely large number of possible 
compounds. Support within the meaning of Article 6 PCT and disclosure 
within the meaning of Article 5 PCT is to be found, however, for only a 
very small proportion of the compounds claimed. In the present case, the 
claims so lack support, and the application so lacks disclosure, that a 
meaningful search over the whole of the claimed scope is impossible. 
Consequently, the search has been carried out for the general concept 
underlying the application and for those parts of the claims which appear 
to be supported and disclosed, namely those parts relating to 
single-chain multimeric G-CSF polypeptides as characterised in claims 
19-28. Claims 1-18,29-35 have been seached in so far they relate to the 
same subject-matter 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 



INTERNATIONAL SEARCH REPORT 

matlon on patent family members 



Intern; 

PCT. 



TDK 



ial Application No 

k 01/00724 



Patent document 


Publication 




Patent family 


Publication 


cited in search report 


date 




member(s) 


date 


US 5684136 A 


04-11-1997 


US 


5316921 A 


31-05-1994 




US 


5328837 A 


12-07-1994 




US 


5763584 A 


09-06-1998 




EP 


0642585 Al 


15-03-1995 




OP 


7508178 T 


14-09-1995 




WO 


9323550 A2 


25-1 1-1993 




US 


5770704 A 


23-06-1998 




EP 


0642580 Al 


15-03-1995 




JP 


7508420 T 


21-09-1995 




US 


5547856 A 


20-08-1996 




WO 


9323541 Al 


25-11-1993 




US 


5580963 A 


03-12-1996 


WO 9902710 A 


21-01-1999 


US 


6242570 Bl 


05-06-2001 




US 


6187564 Bl 


13-02-2001 




AU 


732857 B2 


03-05-2001 




AU 


8383498 A 


08-02-1999 




EP 


1002106 Al 


24-05-2000 




WO 


9902710 Al 


21-01-1999 


EP 0783003 A 


09-07-1997 


AU 


705064 B2 


13-05-1999 




AU 


5514796 A 


18-11-1996 




EP 


0783003 Al 


09-07-1997 




CA 


2194070 Al 


31-10-1996 




WO 


9634016 Al 


31-10-1996 


W0 9605224 A 


22-02-1996 


AU 


— 

703718 B2 


01-04-1999 




AU 


3206695 A 


07-03-1996 




BR 


9506313 A 


05-08-1997 




CA 


2173750 Al 


22-02-1996 




CN 


1131952 A 


25-09-1996 




EP 


0725795 Al 


14-08-1996 




FI 


961600 A 


11-04-1996 




HU 


75900 A2 


28-05-1997 




JP 


9505741 T 


10-06-1997 




NO 


961433 A 


11-06-1996 




TR 


960134 A2 


21-06-1996 




WO 


9605224 Al 


22-02-1996 




US 


6242580 Bl 


05-06-2001 




US 


5705478 A 


06-01-1998 




US 


5958737 A 


28-09-1999 




US 


6028177 A 


22-02-2000 




us 


6238890 Bl 


29-05-2001 


W0 9640772 A 


19-12-1996 


us 


5767078 A 


16-06-1998 




AU 


732294 B2 


12-04-2001 




AU 


6100796 A 


30-12-1996 




CA 


2228277 Al 


19-12-1996 




EP 


0892812 A2 


27-01-1999 




JP 


2000513320 T 


10-10-2000 




WO 


9640772 A2 


19-12-1996 


WO 9417185 A 


04-08-1994 


US 


5581476 A 


03-12-1996 




AT 


195552 T 


15-09-2000 




AU 


697451 B2 


08-10-1998 




AU 


6095794 A 


15-08-1994 




AU 


8197098 A 


15-10-1998 




AU 


8197198 A 


15-10-1998 



Form PCT/ISA/210 (patent family annex) (July 1992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 

matlon on patent family members 



Intern 

PCT 



TDK 



al Application No 

K 01/00724 



Patent document 


Publication 




Patent family 




Publication 


cited in search report 


date 




marnhpr^ 






WO 9417185 A 




CA 


2153896 


Al 


04-08-1994 




CA 


2239360 


Al 


04-08-1994 




CN 


1132527 


A 


02-10-1996 




DE 


69425516 


Dl 


21-09-2000 




DE 


69425516 


T2 


13-06-2001 




DK 


612846 


T3 


09-10-2000 




EP 


0612846 


Al 


31-08-1994 




EP 


0890640 


A2 


13-01-1999 




EP 


0965638 


A2 


22-12-1999 




EP 


0974655 


A2 


26-01-2000 




ES 


2150951 


T3 


16-12-2000 




FI 


953593 


A 


27-07-1995 




JP 


8506018 


T 


02-07-1996 




LV 


12642 


A 


20-04-2001 




NO 


952974 


A 


28-09-1995 




NZ 


261765 


A 


24-10-1997 




M~7 

NZ 


314960 


A 


28-01-1999 




PT 


612846 


T 


31-01-2001 




WO 


9417185 


Al 


04-08-1994 




US 


6261550 


Bl 


17-07-2001 




US 


5790421 


A 


04-08-1998 




US 


2001016191 


Al 


23-08-2001 




ZA 


9400568 


A 


12-09-1994 


WO 9611953 A 


25-04-1996 


US 


5824784 


A 


20-10-1998 




AT 


179991 


T 


15-05-1999 




AU 


706700 


B2 


24-06-1999 




AU 


1841995 


A 


06-05-1996 




AU 


4887099 


A 


11-11-1999 




CA 


2178752 


Al 


25-04-1996 




CA 


2307142 


Al 


25-04-1996 




CN 


1313343 


A 


19-09-2001 




CN 


1139932 


A ,B 


08-01-1997 




DE 


69509628 


Dl 


17-06-1999 




DE 


69509628 


T2 


16-09-1999 




DK 


733067 


T3 


01-11-1999 




EP 


0733067 


Al 


25-09-1996 




EP 


0822199 


A2 


04-02-1998 




ES 


2131811 


T3 


01-08-1999 




GR 


3030526 


T3 


29-10-1999 




HK 


1008826 


Al 


31-03-2000 




IL 


112585 


A 


31-08-2000 




JP 


11310600 


A 


09-11-1999 




JP 


3177449 


B2 


18-06-2001 




JP 


9025298 


A 


28-01-1997 




JP 


9506116 


T 


17-06-1997 




JP 


r» -i nor t 

3177251 


B2 


18-06-2001 




KR 


248111 


Bl 


15-03-2000 




KR 


261030 


Bl 


15-06-2000 




NZ 


281469 


A 


24-04-1997 




i in 

WO 


9611953 


Al 


25-04-1996 




US 


5985265 


A 


16-11-1999 




ZA 


9501008 


A 


18-10-1996 


WO 9967291 A 


29-12-1999 


AU 


4702099 


A 


10-01-2000 




EP 


1090036 


A2 


11-04-2001 




WO 


9967291 


A2 


29-12-1999 





Form PCT/I3A/210 (patent family annex) (July 1992) 



page 2 of 2 



